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- CRYSTAL 

FIELD OF TNVENTTON - 

The present invention relates to a crystal. In particular, the present invention relates 
to a crystal of ACE protein. 

The present invention further relates to methods, processes, ACE modulators, 
pharmaceutical compositions and uses of the ACE crystal and the structure co- 
ordinates thereof. 

BACKGROUND TO THE INVENTION 

Angiotensin converting enzyme (peptidyl dipeptidase A, EC 3.4.15.1, ACE) is a 
membrane-anchored dipeptidyl carboxypeptidase that is essential for blood pressure 
regulation and electrolyte homeostasis via the renin-angiotensin-aldosterone system. 
The enzyme is a zinc metalloprotease that converts the inactive peptide angiotensin I 
to angiotensin H, a potent vasoconstrictor. ACE, like many diverse membrane-bound 
ectoproteins, is released from the membrane by a membrane protease or secretase (1). 
An understanding of this cleavage-secretion mechanism is important for the 
development of therapeutic -strategies to address the different pathologies caused by 
defects in the function of the secretase. Substrate determinants that specify cleavage 
by secretases remain incompletely characterised, but may include the physico- 
chemical properties of the juxtamembrane ("stalk") sequence or unidentified 
recognition motifs of the stalk or the extracellular domain. Cleavage of ACE occurs 
in the stalk sequence and the solubilizing protease is constrained topologically, in 
terms of the number of residues between the cleavage site and the proximal 
extracellular domain of ACE. However, the ACE secretase appears to be remarkably 
versatile in terms of its substrate specificity (2,3). 

The active sites of ACE and carboxypeptidase A, a prototypic zinc metalloprotease, 
are understood to be very similar and this similarity is exploited in the design of the 
first generation of ACE inhibitors. The clinical success of these inhibitors - such as 
captopril and enalapril - in the treatment of hypertension and congestive heart failure 



is well established. However, the side effects such as persistent cough which effects, 
up to 20 % of patients and angioedema which is less common, together with 
limitations such as their contraindication in patients with impaired renal function and 
decreased efficacy in patients with low-renin hypertension, underscore the need for 
i more specific and selective inhibitors. 

There are two isoforms of ACE that are transcribed from the same gene in a tissue 
specific manner. In somatic tissues it exists as a glycoprotein composed of a single 
large polypeptide chain of 1277 amino acids whereas the germinal form is synthesised 

3 as a lower molecular mass isozyme and is thought to play a role in sperm maturation 
and me binding of sperm to the oviduct epithehum. The somatic form consists of two 
domains (N- and C-domain), each containing an active site with a conserved HEXXH 
zinc-binding motif and a glutamate some 24 residues downstream which forms the 
third zinc ligand (Williams et al., 1994). The two domains differ in their substrate 

5 specificities; inhibitor and chloride activation profiles; and physiological functions. 
There are two N-domain-specific substrates: the hemoregulatory peptide N-acetyl- 
seryl-aspartyl-lysyl-proline (AcSDKP) which controls hematopoietic stem cell 
differentiation and proliferation; and the bradykinin potentiating peptide angiotensin- 
(1-7). On the other hand, both active sites catalyse the hydrolysis of angiotensin I and 

10 the vasodilator bradykinin with similar efficiency. However, inhibition of the N- 
domain with a phosphinic peptide RXP407 has np effect on blood pressure regulation 
(Junot et al., 2001) and, furthermore, expression of the N-domain only, in transgenic 
mice produced a phenotype similar to the ACE knockout mice (Esther et al., 1997). 
Thus, the C-domain appears to be necessary and sufficient for controlling blood 

25 pressure and cardiovascular function. Testis ACE (tACE) is identical to the C- 
terminal half of somatic ACE, except for a unique 36-residue sequence constituting its 
amino terminus, thus this isoform is selected for initial efforts to obtain a three- 
dimensional structure. 

30 The cDNA sequence of human testicular ACE has been described (Ehlers et al. (1989) 
Proc. Nat. Acad. Sci. 86: 7741-7745) and the predicted protein consists of a 732-residue 
preprotein including a 31-residue signal peptide. The mature polypeptide has a 
molecular weight of 80,073 (unglycosylated form). 




Despite the pivotal role of ACE, there have been no reports disclosing that suitable 
crystals have ' been or could be obtained for this enzyme and so the X-ray 
crystallographic analysis of such proteins has been impossible. 

5 SUMMARY OF THE INVENTION 

The present invention is based upon the seminal finding of the first three-dimensional 
structure of the ACE protein. 

10- Peptidases, for example, thermolysin and carboxypeptidase A that have been used in 
comparative . molecular field analysis and 3D quantitative structure-activity 
relationship studies of ACE (Waller & Marshall, 1993) show no homology -with the 
structure of ACE described herein. .. Surprisingly, ACE shows significant structural 
homology with that of neurolysin, a member of the M3 family of tiiimet 

15 oligopeptidases. The two proteins do not share any amino acid sequence identity 
(close to random score), yet when the two structures are optimally superimposed 
using DALI server (Holm & Sander, 1999), there is noticeable match with a root 
mean square (r.m.s.) deviation of 4.0 A for 143 C a atoms. Accordingly, the structure 
presented herein, may be used for the development of novel, highly selective ACE 

20 modulators with the potential for greater efficacy, fewer side effects and treatment of 
new indications. 

Thus, in a first aspect, the present invention relates to a crystal of ACE protein. 

25 Preferably, the ACE protein is underglycosylated. 

Preferably, the ACE protein is underglycosylated by removing one or more 
glycosylation sites and/or one or more partially glycosylated sites. More preferably, 
the underglycosylated ACE protein is deglycosylated at amino acids 337 and 586 or 
30 amino acids 90, 109, 155, 337 and 586 of SEQ ID No 2. 



Preferably, the crystal comprises atoms arranged in a spatial relationship represented 
by at least a portion of the structure co-ordinates of Table A or Table B. 



Preferably, the crystal belongs to the space group P2i2i2i . 

Preferably, the crystal has the unit cell dimensions: a= 56.47 A, b=84.90 A and 
c=133.99 A. 

Preferably, the crystal is a crystal of human ACE protein. 

Preferably, the crystal further comprises an entity bound to the ACE protein or a 
portion thereof. More preferably, the entity is bound to the ACE protein or a portion 
thereof by contacting one or more residues of the ACE protein selected from: His384, 
Ala385, Lys542, Tyr551, Tyr554, Glu415 and His544. More preferably, the entity 
modulates the activity of ACE. More preferably, the entity is an inhibitor of ACE - 
such as lisinopril or a derivative thereof. 

In a second aspect, the present invention relates to a method of preparing a crystal of 
ACE protein comprising the steps of: (a) culturing host cells comprising an 
underglycosylated ACE protein; (b) purifying the underglycosylated ACE protein; 
and (c) crystallising the underglycosylated ACE protein. 

Preferably, the ACE protein is underglycosylated by removing one or more 
glycosylation sites and/or one or more partially glycosylated sites. More preferably, 
the underglycosylated ACE protein is deglycosylated at amino acids 337 or amino 
acids 90, 109, 155, 337 and 586 of SEQ ID No 2. 

Preferably, the ACE protein is crystallised using about 10 mM HEPES and about 
0.1% PMSF with an equal volume of a reservoir solution containing about 15 % PEG 
4000, about 50 mM CH 3 COONa.3H 2 0 pH 4.7 and about 10 uM ZnS0 4 .7H 2 0. 

Preferably, the crystal that is prepared has a structure defined by at least a portion of 
the structure co-ordinates of Table A or Table B. 

Preferably, the crystal belongs to the space group P2i2i2i- 



Preferably/ the crystal has the unit cell dimensions: a= 56:47 A, b=84.90 A and 
C-133.99A. "' r '" ;r : ' ' * 

Preferably, the ACE protein is human ACE protein. 

Preferably, the crystal further comprises an entity bound to the ACE protein. More 
preferably, the entity modulates the activity of ACE. More preferably, the entity is an 
inhibitor of ACE - such as lisinopril or a derivative thereof. More preferably, the 
crystal that is prepared has a structure defined by at least a portion of the structure co- 
ordinates of Table B. 

In a third aspect, the present invention relates to a method of screening for a 
modulator of ACE wherein the method comprises the use of a crystal according to the- 
present invention. Preferably, the method comprises the steps of: (a) providing at 
least a portion of the structure co-ordinates of Table A or Table B; (b) employing at 
least a portion of the structure co-ordinates of Table A or Table B to design or select 
or synthesise a putative modulator of ACE; (c) contacting the putative modulator of 
ACE with ACE or a mutant, variant, homologue, derivative or fragment thereof in the 
presence of a substrate; and (d) screening the putative modulator of ACE in an assay 
for the potential to modulate ACE. 

Preferably, at least a portion of the structure co-ordinates of Table A or Table B and/or 
the putative modulator of ACE and/or the substrate are provided on a machine-readable 
data storage medium comprising a data storage material encoded with machine 
readable data. 

Preferably, the putative ACE modulator is from a library of compounds. . 
Preferably, the putative ACE modulator is selected from a database. 
Preferably, the putative ACE modulator is designed de novo. 
Preferably, the putative ACE modulator is designed from a known ACE modulator. 



Preferably, the design or selection of the putative ACE modulator is performed in 
conjunction with computer modelling. 

Preferably, the putative ACE modulator is useful in the prevention and/or treatment of an 
ACE related disorder. More preferably, the ACE related disorder is hypertension. 

In a fourth aspect, the present invention relates to a process comprising the steps of: (a) 
performing the method according to the third aspect of the present invention; (b) 
identifying one or more modulators of ACE; and (c) preparing a quantity of those one or 
more ACE modulators. 

In a fifth aspect, the present invention relates to a process comprising the steps of: (a) 
performing the method according to the third aspect of the present invention; (b) 
identifying one or more ACE modulators; and (c) preparing a pharmaceutical 
composition comprising those one or more identified ACE modulators. 

In a sixth aspect, the present invention relates to a process comprising the steps of: (a) 
performing the method according to the third aspect of the present invention; (b) 
identifying one or more ACE modulators; (c) modifying those one or more ACE 
modulators; and (d) optionally preparing a pharmaceutical composition comprising 
those one or more ACE modulators. 

In a seventh aspect, the present invention relates to a method of obtaining structural 
information about a molecule or a molecular complex of unknown structure by using at 
least a portion of the structure co-ordinates of ACE, comprising the steps of: (a) 
generating X-ray diffraction data from a crystallised molecule or molecular complex; (b) 
applying at least a portion of the structure co-ordinates of ACE to said X-ray diffraction 
pattern to generate a three dimensional electron density map of at least a portion of the 
molecule or molecular complex; and (c) using all or a portion of the structure co- 
ordinates of ACE to generate homology models of ACE. 
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In an eighth aspect, the present invention relates to an ACE" modulator identified by 
the method according to the third aspect of the present invention. Preferably, the ACE" 
modulator inhibits ACE. 

5 In a ninth aspect, the present invention relates to a pharmaceutical composition 
comprising an ACE modulator according to the seventh aspect of the present 
invention and a pharmaceutically acceptable carrier, diluent, excipient or adjuvant or 
any combination thereof. 

10 In a tenth aspect, the present invention relates to a method of preventing and/or 
treating an an ACE related disorder comprising administering a modulator of ACE 
according to the seventh aspect of the present invention and/or a pharmaceutical 
according to the eighth aspect of the present invention, wherein said modulator of 
ACE or said pharmaceutical is capable of causing a beneficial preventative and/or 

15 therapeutic effect. 

In an eleventh aspect, the present invention relates to a computer for producing a 
three-dimensional representation of ACE wherein said computer comprises: (a) a 
computer-readable data storage medium comprising a data storage material encoded 

20 with computer-readable data, wherein said data comprises the structure co-ordinates 
of ACE; (b) a working memory for storing instructions for processing said computer- 
readable data; (c) a central-processing unit coupled to said working memory and to 
said computer-readable data storage medium for processing said computer-machine 
readable data into said three-dimensional representation; and (d) a display coupled to 

25 . said central-processing unit for displaying said three-dimensional representation. 

In a twelfth aspect, the present invention relates to a machine-readable data storage 
medium comprising a data storage material encoded with machine-readable data, 
wherein the data is defined by at least a portion of the structure co-ordinates of ACE 
30 in Table A or Table B. 



In a thirteenth, aspect, the present invention relates to the use of an ACE crystal in the 
preparation of a medicament to prevent and/or treat an ACE related disorder. 
Preferably, the ACE related disorder is hypertension. 

5 In a fourteenth aspect, the present invention relates to the use of at least a portion of the 
structure co-ordinates of Table A or Table B to screen for modulators of ACE. 

In a fifteenth aspect, the present invention relates to the use of at least a portion of the 
structure co-ordinates of Table A or Table B to solve the structure of the crystalline 
10 form of any other protein with significant amino acid sequence homology to any 
functional domain of ACE. 

In a sixteenth aspect, the present invention relates to the use of at least a portion of the 
structure co-ordinates of Table A or Table B in molecular design techniques to design, 
1 5 select and synthesise modulators of ACE. 

In a seventeenth aspect, the present invention relates to the use of at least a portion of 
the structure co-ordinates of Table A or Table B in the development of compounds 
that can isomerise to reaction intermediates in the chemical reaction of a substrate or 
20 other compound that binds to ACE. 

In an eighteenth aspect the present invention relates to the use of at least a portion of 
the structure co-ordinates of Table A or Table B to screen small molecule databases 
for chemical entities or compounds that modulate ACE. 

25 

In a nineteenth aspect, the present invention relates to the use of at least a portion of the 
structure co-ordinates of Table A or Table B to solve the structure of the crystalline 
form of any other protein with significant amino acid sequence homology to any 
functional domain of ACE. Preferably, the structure of the crystalline form of any 
30 other protein with significant amino acid sequence homology to any functional 
domain of ACE is solved using molecular replacement 



In a twentieth aspect, the present invention relates to the expression vectors pLEN- 
tACEA36g(l, 2,3, 4) and pLEN- tACEA36g(l,3). 




DETAILED DESCRIPTION OF THE INVENTION • 



ACE PROTEIN 

5 

ACE (EC 3.4.15.1) is a peptidyldipeptide hydrolase. It catalyzes the hydrolysis of the 
penultimate peptide bond at the C-terminal end of a variety of acylated tripeptides and 
larger polypeptides having an unblocked alpha-carboxy group having a fyee C- 
terminus. The reactivity of the enzyme varies markedly depending on the substrate. 

10 

As used herein, the term "ACE protein" includes all vertebrate and mammalian forms 
and is intended to cover mutants, variants, homologues, derivatives and fragments 
thereof. Preferably, the mutants, variants, homologues, derivatives and fragments 
thereof have the activity of the naturally occurring ACE. 

15 

There are two isoforms of ACE that are transcribed from the same gene in a tissue 
specific manner. In somatic tissues, it exists as a glycoprotein composed of a single 
large polypeptide chain of 1277 amino acids whereas the germinal form is synthesised 
as a lower molecular mass isozyme and is thought to play a role in sperm maturation 

20 and the binding of sperm to the oviduct epithelium. The somatic form consists of two 
domains (N- and C-domain), each containing an active site with a conserved HEXXH 
zinc-binding motif and a glutamate some 24 residues downstream which forms the 
third zinc ligand (Williams et al., 1994). The two domains differ in their substrate 
specificities; inhibitor and chloride activation profiles; and physiological functions. 

25 There are two N-domain-specific substrates: the hemofegulatory peptide JV-acetyl- . 
seryl-aspartyl-lysyl-proline (AcSDKP) which controls hematopoietic stem cell 
differentiation and proliferation; and the bradykinin potentiating peptide angiotensin- 
(1-7). On the other hand, both active sites catalyse the hydrolysis of angiotensin I and 
the vasodilator bradykinin with similar efficiency. However, inhibition of the N- 

30 domain with a phosphinic peptide RXP407 had no effect on blood pressure regulation 
(Junot et al., 2001) and, furthermore, expression of the N-domain only, in transgenic 
mice produced a phenotype similar to the ACE knockout mice (Esther et al., 1997). 
Thus, the C-domain appears to be necessary and sufficient for controlling blood 
pressure and cardiovascular function. Testis ACE (tACE) is identical to the C- 



10 

terminal half of somatic ACE, except for a unique 36-residue sequence constituting its 
amino terminus. 

Further background teachings on ACE have been presented by Victor A. McKusick et 
al at http://www3.ncbi.nlm.nih.gov/Omim/searchomm The following 

information concerning ACE has been extracted from that source. 

Angiotensin I-converting enzyme or lcininase II, is a dipeptidyl carboxypeptidase that plays an 
important role in blood pressure regulation and electrolyte balance by hydrolyzing angiotensin 
I into angiotensin 31, a potent vasopressor, and aldosterone-stimulating peptide. The enzyme is 
also able to inactivate bradykinin, a potent vasodilator. 

Ehlers et al. (1989) Proc. Nat. Acad. Set 86: 7741-7745 determined the cDNA sequence for 
human testicular ACE. The predicted protein is identical, from residue 37 to its C terminus, to 
the second half or C-terminal domain of the endothelial ACE sequence. The inferred protein 
sequence consists of a 732-residue preprotein including a 31-residue signal peptide. The 
mature polypeptide has a molecular weight of 80,073. 

Howard et al. (1990) Mol Cell Biol 10: 4294-4302, found that the testis-specific form of 
ACE has its own promoter within intron 12, is encoded by the 3-prime region of the gene, and 
is found only in postmeiotic spermatogenic cells and sperm. 

Although angiotensin-converting enzyme has been studied primarily in the context of its role 
in blood pressure regulation, this widely distributed enzyme has many other physiologic 
functions. The ACE gene encodes 2 isozymes. The somatic ACE isozyme is expressed in 
many tissues, including vascular endothelial cells, renal epithelial cells, and testicular Leydig 
cells, whereas the testicular or germinal ACE isozyme is expressed only in sperm Ramaraj et 
al., (1998) J. Clin. Invest 102: 371-378. 

Brown et al. (1996) Clin. Pharmacol. Therapeutics 60: 8-13 found an association between the 
use of certain ACE inhibitors (lisinopril or enalapril vs captopril) and emergent angioedema in 
the African-American population of Tennessee. The adjusted relative risk of angioedema was 
4.5 (95% CI, 2.9-6.8) in blacks compared to whites. The African-American patients were 
more severely affected: 7 of the 8 patients admitted to the intensive care unit were black, as 
were all patients who required intubation. African-American users of ACE inhibitors tended to 
be younger and female when compared to their white counterparts. The rate of angioedema 
was highest within the first 30 days of use (5.79 per 1000 patient-years) compared to long- 
term use (1.04 per 1000 patient-years). 

Large-scale trials of therapy for heart failure showed improvements in outcome with ACE 
inhibitors and beta-blockers. These results led to the recommendation that all patients who 
have heart failure accompanied by a low ejection fraction and who can tolerate ACE inhibitors 
and beta-blockers should be treated with both agents. Exner et al. (2001) New Eng. J. Med 
344: 1351-1357 focused on the fact that black patients with heart failure have a poorer 
prognosis than white patients and performed a study comparing racial groups. They found that 
whereas therapy with enalapril is associated with significant reduction m the risk of 
hospitalization for heart failure among white patients with left ventricular function, it had no 
such effect in similar black patients. The explanation for the lesser response to the ACE 
inhibitor in black patients was not clear. 

Mattei et aL (1989) Cytogenet Cell Genet 51: 1041 assigned the ACE gene to 17q23 by in 
situ hybridization. Using a DNA marker at the growth hormone gene locus, which they 
characterized as 'extremely polymorphic' and which showed no recombination with ACE, 
Jeunemaitre et al. (1992) Nature Genet 1: 72-75, mapped ACE to 17q22-q24, consistent with 



the in situ hybridization mapping to 17q23. A demonstration of linkage between the ACE 
. locus and elevated blood pressure in a rat model of hypertension pointed to ACE as a 
candidate gene' in human hypertension. In studies of hypertensive families, they found no 
evidence to support linkage between the ACE locus and the disease, however. 

Krege et al. (1995) Nature 375: 146-148* investigated Hie role of the ACE gene in blood 
pressure control and reproduction using mice generated to carry an insertional mutation that 
was designed to inactivate both forms of Ace. All homozygous female mutants were found to 
be fertile, but the fertility of homozygous male mutants was greatly reduced. Heterozygous 
males but not females had blood pressures that were 15 to 20 mm Hg less than normal, 
. . although both male and female heterozygotes had reduced serum Ace activity. 

Although significant ACE activity is found in plasma, the majority of the enzyme is bound to 
tissue such as vascular endothelium. Esther et al. (1997) J. Clin. Invest 99: 2375-2385, used 
targeted homologous recombination to create mice expressing a form of ACE that lacks the C- 
terminal half of the molecule. This modified ACE protein was catalytically active but entirely 
secreted from cells. Mice that expressed only this modified ACE had significant plasma ACE 
activity but no tissue-bound enzyme. These animals had low blood pressure, renal vascular 
thickening, and a urine-concentrating defect. The phenotype was very similar to that of 
completely ACE-deficient mice previously reported, except that the renal pathology was less 
severe. These studies strongly supported the concept that the tissue-bound ACE is essential for 
the control of blood pressure and the structure and function of the kidney. 

ACE gene knockout mice lack both isozymes and exhibit low blood pressure, kidney 
dysfunctions, and male infertility. Ramaraj et al. (1998) J. Clin. Invest 102: 371-378, reported 
the use of a sperm-specific promoter and interbreeding of transgenic- and gene knockout mice 
for generating a mouse strain that expressed ACE only in sperm. The experimental mice 
maintained the kidney defects of ACE -/- mice, but unlike the knockout strain, the males were 
fertile. Thus, Ramaraj et al. (1998) established that the role of ACE in male fertility is 
completely dependent on its exclusive expression in sperm. Their study demonstrated how 
transgenic and knockout techniques can be combined for ascribing a specific physiologic 
function to the expression of a multifunctional protein in a given tissue. 

Hagaman et al. (1998) Proc. Nat Acad. Set 95: 2552-2557 used transgenic mice lacking 
somatic and testis ACE to investigate the male fertility defect. They demonstrated that mice 
lacking both somatic and testis ACE isozymes have defects in sperm transport within the 
oviducts and in binding to zonae pellucidae. Males generated by gene targeting experiments 
that lack somatic ACE but retain testis ACE are fertile. Both male and female mice lacking 
angiotensinogen have normal fertility. The authors found that males heterozygous for the 
mutation inactivating both ACE enzymes had offspring of wildtype and heterozygous, 
genotypes at the same frequency, suggesting that sperm carrying the mutation are not at a 
selective disadvantage. • 

Nephropathy of type I diabetes is associated with the D allele of the insertion/deletion QJD) 
polymorphism in intrbn 16 of the ACE gene. The D allele determines higher enzyme levels. 
To address causality underlying this association, Huang et al. (2001) Proc. Nat Acad Sci. 98: 
13330-13334 induced diabetes in mice having 1, 2, or 3 copies of the Ace gene, normal blood 
pressure, and an enzyme level range (65-162% of wildtype) comparable to that seen in 
humans. Twelve weeks later, the 3-copy diabetic mice had increased blood pressures and 
overt proteinuria. Proteinuria was correlated to plasma ACE level in the 3-copy diabetic mice. 
Thus, a modest genetic increase in ACE levels was sufficient to cause nephropathy in diabetic, 
mice. 



CRYSTAL 



In one aspect of the present invention, there is provided a crystal of ACE protein. 



12 



As used herein, the term "crystal" means a structure (such as a three dimensional (3D) 
solid aggregate) in which the plane faces intersect at definite angles and in which 
there is a regular structure (such as internal structure) of the constituent chemical 
species. Thus, the term "crystal" can include any one of: a solid physical crystal form 
such as an experimentally prepared crystal, a 3D model based on the crystal structure, 
a representation thereof - such as a schematic representation thereof, a diagrammatic 
representation thereof, or a data set thereof for a computer. 

V 

The crystals of the present invention may be prepared by purifying ACE protein and 
then crystallising the purified protein. The ACE protein may also be prepared by 
' expressing a nucleotide sequence encoding the ACE protein in a suitable host cell. 

In a preferred embodiment, the crystals of the present invention are prepared by 
purifying underglycosylated ACE protein and then crystallising the purified 
15 underdeglycosylated protein. The underdeglycosylated ACE protein may also be 
prepared by expressing a nucleotide sequence encoding the underdeglycosylated ACE 
protein in a suitable host cell. 

ACE may be purified using various methods known to a person skilled in the art, for 
oo example, from conditioned media by affinity chromatography on a Sepharose-28- 
lisinopril affinity resin (Yu et al. 1997). The protein may be quantified by amino acid 
analysis and assayed for activity using the substrate hippuryl-L-histidyl-L-leucine, as 
described previously (Ehlers, MRE, Chen, Y -N, Riordan, JF (1991) Proc. Natl. Acad 
Sci. USA 88, 1009-1013). 

25 

The purified ACE proteins may be stored at -20 °C in 10 mM HEPES and0.1%PMSF. 

Concentration may be performed with the aid of a filtration system and the protein 
concentrate may be immediately used for crystallisation purposes. The protein 
30 concentrate may be crystallised using, for example, the vapour diffusion hanging drop 
method at a temperature of from about 1 °C to about 30 °C, preferably from about 4 °C 
to about 20 °C, more preferably at about 16 °C. The crystallisation temperature may be 
dependent on the additives present in the protein solution. 
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Typically, the best crystals for ACE proteins are grown at 16 °C by the vapour 
diffusion hanging drop method by mixing 2 uJ of the protein solution at ~1 1 .5 mg/ml 
in 10 mM HEPES and 0.1% PMSF with an equal volume of a reservoir solution 
containing 15 % PEG 4000, 50 mM CH 3 COONa.3H 2 0 pH 4.7 and 10 uM 
ZnS0 4 .7H 2 0. Crystals usually appear within 2 weeks and grow to their maximum 
size after about a month. 

The design of ACE inhibitors has been based upon the assumption that the structure 
of ACE is related to that of peptidases - such as thermolysin (MA clan, M2 family) 
and carboxypeptidase A (MC clan, M14 family) as evidenced by comparative 
molecular field analysis and 3D quantitative structure-activity relationship studies of 
ACE (Waller & Marshall, 1993). Marshall & Cramer (1988) Trends Pharmacol. 
Sci. 9, 285-289 have also reported the development of predictive models for inhibitors 
of ACE and thermolysin. Surprisingly, the structure of ACE described herein, shows 
no structural homology with thermolysin but shows significant structural homology 
with neurolysis a member of the M3 family of thimet oligopeptidases. Therefore, the 
structure presented herein, may be effectively used for the development of novel, 
highly selective ACE modulators with the potential for greater efficacy, fewer side 
effects and treatment of new indications. In addition, the unanticipated similarity with 
neurolysin has shown the structural conservation amongst an merging family of 
peptidases with a common evolutionary origin. 

Without wishing to be bound by theory, it appears that the core structure of the two 
proteins is highly similar with different loop structures on the outer surface in the case 
of neurolysin. Indeed like ACE, neurolysin also belongs to the family of 
metallopeptidases bearing the HEXXH active" site motif (Rawlings and Barrett, 1995; 
Brown et a!., 2001), The striking similarity also extends to the active site region in 
neurolysin consisting of a deep narrow channel that divides the molecule into two 
halves. It has been speculated that using the flexible secondary structure elements in 
the active site cavity, the neuropeptidase can effectively cleave a variety of small 
peptides. Likewise in ACE, the geometry of the active site groove clearly accounts for 
ACE's inability to hydrolyse large; folded substrates. Furthermore, the enzyme's 
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preference for oligopeptide substrates of about thirteen residues" or less suggests that 
the substrate does not have the same freedom to extend outside of the channel during 
catalysis. Peptidases - such as thermolysin (MA clan, M4 family) and 
carboxypeptidase A (MC clan, Ml 4 family) - which have been used in comparative 
5 molecular field analysis and 3D quantitative structure-activity relationship studies 
(Waller & Marshall, 1993) show no structural homology with ACE. 

A crystal according to any one of the preceding claims comprising atoms arranged in 
a spatial relationship represented by at least a portion of the structure co-ordinates of 
10 Table A or Table B. 

Preferably, the crystal belongs to the space group V2\2\2\ and has a unit cell with 
dimensions of: a- 56.47 A, b=84.90 A, c=133.99 A. 

15 Preferably, the crystal is a crystal of human ACE protein. 

Complexes may be obtained by growing the crystals in the presence of an entity - 
such as a test compound. In these experiments the protein solution is mixed with the 
entity and an equal volume of the reservoir solution before setting up the 
20 crystallisation. Single crystals suitable for diffraction work typically appear after 
about 4 weeks. 

Typically, the protein comprising ACE is purified to homogeneity for crystallisation. 
Purity of ACE may be measured by typical techniques such as SDS-PAGE, mass 
25 spectrometry and hydrophobic HPLC. 

The crystal structure of the invention may contain a portion - such as at least 25%, at 
least 50%, at least 75%, or preferably at least 90%, at least 95%, at least 98%, or at 
least 99% - of the co-ordinates listed in Table A or Table B. Preferably, the crystal 
30 structure of the invention contains all of the co-ordinates listed in Table A or Table B. 



Preferably, the crystal is usable in X-ray crystallography techniques. 
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Preferably, the crystals used can withstand~exposure to X-ray beams' used to produce 
diffraction pattern data necessary to solve the X-ray crystallographic structure. 

Preferably, prior to data collection, the crystals are flash-cooled at about 100 K in a 
cryoprotectant. More preferably, cryoprotectant contains 15 % PEG 4000, 50 mM 
CH 3 COONa.3H 2 0 pH 4.7, 10 pM ZnS0 4 .7H 2 0 and 25% glycerol. 

The X-ray data may be collected at a Synchrotron Radiation Source. Preferably, the 
X-ray data are collected at a Synchrotron Radiation Source at 100 °K. 

Preferably, the crystal has a resolution determined by X-ray crystallography of about 
3.5A or less, more preferably a resolution of about 2.8A or less, more preferably, a 
resolution of about 2 A or less, more preferably, a resolution of about 1 .5A or less, most 
preferably, 1 A or less. 

DEGLYCOS YLATION AND UNDERGLYCOSYLATIQN 



Many proteins in eukaryotic cells are glycoproteins that contain oligosaccharide 
chains covalently linked to certain amino acids. Glycosylation is known to affect 
20 protein folding, localisation and trafficking, protein solubility, antigenicity, biological 
activity and half-life, as well as cell-cell interactions. 

Protein glycosylation can be divided into four main categories mainly depending on the 
linkage between the amino acid and the sugar. These are N-linked glycosylation, O- 
25 linked glycosylation, C-mannosylation and GPI anchor attachments. N-glycosylation is 
characterised by the addition of a sugar to the amino group of an asparagine. In O- 
glycosylation, a sugar is attached to the hydroxyl group of a serine or threonine residue. 

For N-glycosylation, the sequence motif Asn-Xaa-Ser/Thr (wherein Xaa is any amino 
30 acid other than Pro) has been defined as a prerequisite for glycosylation. Although rare, 
the sequence motif Asn-Xaa-Cys can also be an acceptor site. N-glycans can be 
subdivided into three distinct groups called 'high mannose type 1 , 'hybrid type', and 
'complex type', with the common pentasaccharide core - Manp(alphal,6)- 
(Manp(alphal,3))-Manp(betal,4)-GlcpNAc(betal,4) GlcpNAc(betal,N>Asn - occuring 
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in all three groups. The relationship between all three types can be ascribed to the fact 
that they originate from one precursor oligosaccharide which contains the described 
common pentasaccharide core Man3-GlcNAc2, and some additional sugar residues and 
the non-reducing end, and is then processed enzymatically to yield these three types. 
Since the hydroxyl group of Ser/Thr is thought to be involved in hydrogen bonding 
during the enzymatic attachment of the oligosaccharide precursor molecule to yield a 
favourable conformation for the action of the ohgosacchaiyltransferase, it has been 
suggested for proline that the steric hindrance might be too large (Kornfeld (1985) Ann. 
Rev. Biochem. 54: 631-64), preventing glycosylation at Pro containing sites. The 
negative influence of aspartic acid towards glycosylation can be ascribed to the negative 
charge on the side chain of this residue. In addition some cases have been reported 
where Ser/Thr is replaced by cysteine. While Ser replacement by Cys generally leads to 
decreased glycosylation, it has been shown (Kasturi 1995 J. Biol. Chem. 270: 14756-61) 
that substitution by Thr at a given potential glycosylation site can lead to increased 
glycosylation. This is in accordance with the model of hydrogen bonding being an 
important factor during the attachment of the precursor molecule to the protein. 
Although there are usually many potential glycosylation sites in a protein it has been 
estimated that glycosylation occurs only at one third of them. Mostly at those sites where 
the surrounding amino acids allow the formation of a beta turn. 

Various glycoforms of ACE have been described. By way of example, Sadhukhan & 
Sen (3a) disrupted specific glycosylation sites in rabbit tACE to elucidate the 
glycosylation requirements for the expression and processing of active testis ACE. 
There are five potential N-linked glycosylation sites in the rabbit tACE sequence, with 
an additional six in the somatic form (4a). A null mutant, where all five sites had 
been disrupted, behaved similarly to wild-type tACE expressed in the presence of the 
glycosylation-inhibitor, tunicamycin. It was degraded intracellularly and failed to be 
detected in culture medium, confirming previous findings that tACE requires N-linked 
glycosylation to be expressed in an active form (5a and 3a). Expression of the 
remaining mutants showed a preference for iV-linked glycosylation at the N-terminus 
and that the presence of sugars at a single N-terminal site was necessary and sufficient 
to produce enzymatically-active tACE that was solubilised. The presence of 
glycosylation is not site-specific, as mutants that have either the first site or second 
site intact are expressed and active. However, glycosylation at the third site alone is 
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not sufficient to produce active protein in HeLa cells, albeit this mutant was expressed 
in yeast (3a), indicating that the requirements for glycosylation aire cell-specific. 

In human testis ACE Olinked sugars are not necessary for expression, implicating the 
5 N-linked sugars in this role (6a). JV-linked glycosylation of human tACE expressed in 
CHO cells at each site has been identified by MALDI-TOF mass spectrometry (7a). 
There are seven potential JV-linked sites in human tACE, five of which are 
complementary to the sites in rabbit tACE (7a). The unique sites lie within the 
ectodomain (the fourth site) and in the juxtamembrane stalk region, adjacent to the 
10 cleavage site (the seventh site). As with the rabbit form, there appears to be a 
preference for glycosylation at the N-terminus as evidenced by MALDI-TOF mass 
spectrometry of glyosylation sites (7a). Inhibition of complex oligosaccharide 
formation using a. glucosidase I inhibitor iV-butyldeoxynojirimycin (NB-DNJ) led to 
the production of an active glycoform that was electrophoretically homogeneous (7a). 

15 

Suitably, the crystal of the ACE protein may comprise de-glycosylated ACE protein 
or a fragment thereof. For example, the deglycosylated ACE may comprise the 
sequence presented as SEQ ED No. 2. 

20 To deglycosylate the ACE protein, various methods known to a person skilled in the 
art may be used. Both chemical and enzymatic methods may be used for removing 
oligosaccharides from glycoproteins. Hydrazinolysis of glycoproteins (Kuraya, N & 
Hase (1992) J Biochem (Tokyo) 112:122-126), is capable of removing bothN- and O- 
linked sugars, although this results in the complete destruction of the protein 

25 component and is therefore not suitable if recovery of the protein is desirable. Milder 
chemical methods such as trifluoromethanesulphonic acid (TFMS) may be used, 
however this may result in incomplete sugar removal and partial protein destruction. 
Other methods — such as site directed mutagenesis of glycosylated amino acids may 
also be used. 

30 



Suitably, enzymatic methods may be used which provide for complete sugar removal 
with no protein degradation. 
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Use of Hie enzyme PNGase F is an effective method of removing virtually all N-linked 
oligosaccharides from glycoproteins (Tarentino & Plummer (1994). Methods in 
Enzymol 230: 44-57). The oligosaccharide is left intact and therefore suitable for further 
analysis (the asparagine residue from which the sugar was removed is deaminated to 
aspartic acid, the only modification to the protein). 

Other commonly used endoglycosidases include Endoglycosidase H (Kobata (1979) 
Anal Biochem 100:1-14) and Endoglycosidase F (Trimble & Tarentino (1991) J. 
Biochem. 266:1646-1651). In a preferred method, the ACE protein is digested .with 
Endoglycosidase H (30 mU) in a suitable buffer - such as 100 mM sodium phosphate, 
0.1 mM ZnCl 2 and 1% BSA, pH 6.0 for 16 h at 37°C. The endo H-treated protein is 
passed through a lectin affinity column consisting of equal parts of concanavalin A, 
wheat germ, and lentil lectin, after equilibration with 20 mM Tris-HCl, 0.5 M NaCl at 
pH 7.4. The deglycosylated ACE is collected in the flowthrough. Free 
oligosaccharides and any other impurities are removed from the flowthrough fraction 
by a final lisinopril-Sepharose affinity chromatography step. The homogeneity of the 
ACE protein after deglycosylation is confirmed by SDS-PAGE on a 4-20% 
acrylamide gel and MALDI-TOF mass spectrometry. 

Commercially available kits may also be used - such as the E-DEGLY kit (Sigma- 
• Aldrich, UK) and the GlycoFree Deglycosylation Kit (Glyco, Novato, USA) which 
removes bothN- and O-linked glycans from glycoproteins. 

Preferably, ACE is crystallised using underglycosylated ACE protein. 

As used herein, the term "underglycosylated" means that one or more of the 
oligosaccharide chains covalently linked to amino acids in the glycosylated protein 
are no longer present. 

By way of example only, testis ACE is glycosylated at amino acids 72, 90 and 109 
and glycosylated partially at amino acids 155, 337 and 586. Accordingly, the 
"underglycosylated" testis ACE may not be glycosylated a one or more of arnino 
acids 72, 90, 109, 155, 337 and 586. 
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The underglycosylated ACE protein may be prepared using various methods - such as 
site directed mutagenesis * and glycosylation inhibition methods. For example, 
glycosylation inhibition methods using NB-DNJ may prevent the formation of complex 
oligosaccharides. NB-DNJ inhibits glucosidase I and prevents maturation of the sugars. 

5 

Preferably, the underglycosylated ACE protein is prepared by culturing host cells in 
the presence of N-bxjtyldeoxynojirimycin (NB-DNJ). 

Preferably, underglycosylated ACE protein is prepared using site-directed 
10 mutagenesis. More preferably, the underglycosylated . ACE protein comprises a 
mutation at amino acid 337 of SEQ ID No. 2 or amino acids 90, 109, 155, 337 and 
586ofSEQIDNo. 2. 

Underglycosylated mutants may yield crystals in the orthorhombic, space 
15 group. that diffract to 2.8A or less. Moreover, crystals from a truncated mutant of 
tACE that only have simple high mannose oligosaccharides are also grown in the 
orthorhombic space group ¥2\l\l\ which may diffract to better than 2.0 A resolution. 
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PREPARING A CRYSTAL OF ACE PROTEIN 

In another aspect, the present invention relates to a method of preparing a crystal of ACE 
protein, comprising the steps of (a) culturing host cells comprising an underglycosylated 
ACE protein; (b) purifying the underglycosylated ACE protein; and (c) crystallising the 
underglycosylated ACE protein. 

Preferably, the ACE protein is underglycosylated by removing one or more 
glycosylation sites and/or one or more partially glycosylated sites. More preferably, 
the underglycosylated ACE protein comprises a mutation at amino acid 337 of SEQ ID 
No 2 or amino acids 90, 109, 155, 337 and 586 of SEQ ID No 2. 
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The ACE protein may be underglycosylated and purified using the methods described 
herein. 



20 



Preferably, the ACE protein is crystallised using about 10 mM HEPES and about 
0.1% PMSF with an equal volume of a reservoir solution containing about 1 5 % PEG 
4000, about 50 mM CH 3 COONa.3H 2 0 pH 4.7 and about 10 uM ZnS0 4 .7H 2 0. 

Preferably, the ACE protein is crystallised in the presence of an entity, for example, a 
modulator of ACE. 

MODULATORS OF ACE 

The role of ACE in the pathogenesis of hypertension has resulted in a search for 
modulators (eg. inhibitors) of the enzyme that could act as antihypertensive drugs (eg. 
US 3,891,616, US 3,947,575, US 4,052,511 and US 4,053,651). Therapeutic 
vasodepressors - such as Captopril and D-2-methyl-3-mercaptopropanoyl-L-proline - 
have been synthesised as ACE inhibitors. Numerous synthetic peptide derivatives have 
also been shown to be ACE inhibitors as disclosed in US 3,832,337. 

Natural substances that inhibit ACE include snake venom and those derived from 
foodstuffs - such as ACE inhibiting peptides produced by enzymatic hydrolysis of 
proteins, such as casein or fish meat protein (by Hiroyuki Ukeda, Nippon Nogei Kagaku 
Kaishi (Journal of Japan Society for Bioscience, Biotechnology, and Agrochemislry, 
66(1), 25-29 (1992)). 

ACE inhibiting synthetic substances include captopril (D-2-methyl-3- 
mercaptopropanoyl-L-proline) which has already been put to practical application as an 
orally administered vasodepressor. 

However, many ACE inhibiting substances exhibit side effects in many cases and special 
attention needs be exercised in safety aspects. 



The present invention permits the use of molecular design techniques to design, select 
and synthesise chemical entities and compounds, including ACE modulating 
compounds, capable of binding to ACE, in whole or in part. 



Thus, in a further aspect, the present invention relates to a method of screening for a 
modulator of ACE wherein the method comprises the use of a crystal of ACE. 

Preferably, the method comprises the steps of: (a) providing at least a portion of the 
structure co-ordinates of Table A or Table B; (b) employing at least a portion of the 
structure co-ordinates of Table A or Table B to design or select or synthesise a 
putative modulator of ACE; (c) contacting the putative modulator of ACE with ACE 
or a mutant, variant, homologue; derivative or fragment thereof in the presence of a 
substrate; and (d) screening the putative modulator of ACE in an assay for the 
potential to modulate ACE. 

By way of example, the structure co-ordinates may be used to design compounds that 
bind to the enzyme and may alter the physical properties of the compounds (eg. 
solubility) or the enzyme itself. This invention may be used to design compounds that 
act as modulators - such as competitive inhibitors - of ACE by binding to all or a 
portion of the active site of ACE. Compounds may also be designed that act as non- 
competitive inhibitors of ACE, These as non-competitive inhibitors may bind to all or 
a portion of ACE already bound to its substrate and may be more potent and specific 
than known ACE inhibitors that compete only for the ACE active site. Similarly, 
non-competitive inhibitors that bind to and inhibit ACE whether or not it is bound to 
another chemical entity may be designed using the structure co-ordinates of ACE as 
described herein. 

By way of example, it may be found that the COOH-binding active site residue differs 
between the N and C domain active sites and/or that it may be amenable to the 
incorporation of a functionality that can covalently modify this residue to produce an 
irreversible inhibitor design. It has long been assumed that the COOH-binding 
residue is a positively charged arginine (M. A. Ondetti & D. W. Cushman (1981) in 
Biochemical Regulation of Blood Pressure (R. L. Soffer, ed.), Wiley, New York, 165- 
204), but the ACE-lisinopril X-ray crystal structure described herein, shows that it is a 
lysine, in the case of the C-domain active site. This may present an opportunity for 
covalent modification, by, for example, the introduction of an alkyl halide or halo- 
ketone functionality into the inhibitors that can alkylate the lysine amine, or ot-ketone 




or aldehyde that can form a Schiffs base with the lysine amine, or the use of activated 
ester or thioester groups, or other modified carboxyl groups susceptible to 
nucleophilic attack. 

In a preferred embodiment, at least a portion of the structure co-ordinates of Table A 
or Table B and/or the putative modulator of ACE and/or the substrate are provided on 
a machine-readable data storage medium comprising a data storage material encoded 
with machine readable data. 

An ACE crystal may be probed with a variety of different chemical entities or test 
compounds to determine optimal sites for interaction between modulators of ACE and 
the enzyme. For example, X-ray diffraction data collected from crystals grown in the 
presence of chemical entities or test compounds may allow the elucidation of how the 
chemical entities or test compounds interact with ACE. Molecules that bind to those 
sites can then be designed and synthesised and tested for their ACE modulating 
activity. 

The present invention may also allow the development of compounds that can 
isomerise to reaction intermediates in the chemical reaction of a substrate or other 
compound that binds to ACE. Thus, the time-dependent analysis of structural 
changes in ACE during its interaction with other molecules may be performed. The 
reaction intermediates of ACE may also be deduced from the reaction product in co- 
complex with ACE. Such information is especially useful to design improved 
analogues of known ACE modulators or to design new ACE modulators based on the 
reaction intermediates of the ACE enzyme and ACE-modulator complex. This may 
provide a new route for designing ACE modulators with high specificity and stability. 
Preferably, this provides a new route for designing ACE modulators with high 
specificity, high stability and low toxicity. 

Small molecule databases or test compounds may be screened for chemical entities or 
compounds that can bind in whole, or in part, to ACE. Thus, in a preferred 
embodiment, the putative ACE modulator is from a library of compounds or a 
database. In this screening, the quality of fit of such entities or compounds to the 
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binding site may be judged by various methods - such as shape complementarity or 
estimated interaction energy (Meng, E. C. et al. 9 J. Comp. Chem,, 13, pp. 505-524 
(1992)). 

Because ACE protein or a mutant, variant, homologue, derivative or fragment thereof 
may crystallise in more than one crystal form, the structure co-ordinates of ACE, -or 
portions thereof, may be particularly useful to solve the structure of other crystal 
forms of ACE. They may also' be used to solve the structure of ACE mutants, ACE 
variants, ACE homologues, ACE derivatives, ACE fragments and ACE complexes. 



Preferably, the structure co-ordinates of ACE are used to solve the structure of the 
crystalline form of any other protein with significant amino acid sequence homology 
to any functional domain of ACE. By way of example, molecular replacement may 
be used. In this method, the unknown crystal structure, whether it is another crystal 

15 form of ACE, an ACE mutant, an ACE variant, an ACE homologue .(eg. another 
protein with significant amino acid sequence homology to any functional domain of 
ACE), an ACE derivative, an ACE fragments or an ACE co-complex may be 
determined using the ACE structure co-ordinates of the present invention. This 
method will provide a more accurate structural form for the unknown crystal more 

20 quickly and efficiently than attempting to determine such information ab initio. 

In a preferred embodiment of the present invention, the ACE crystal further comprises 
an entity bound to the ACE protein or a portion thereof. For example, ACE may be 
crystallised in complex with an entity that is an inhibitor of ACE eg. lisinopril. 

25 

Preferably, the entity is bound to the ACE protein or a portion thereof by contacting 
one or more residues of the ACE protein selected from: His384, Ala385, Lys542, 
Tyr551, Tyr554, Glu415 and His544. 

30 The crystal structures of a series of such complexes may then be solved by molecular 
replacement or in combination with MAD (Multiwavelength Anomalous Dispersion) 
and/or MIRAS (Multiple Isomorphous Replacement with Anomalous Scattering) 
procedures - and compared with that of wild-type ACE. Potential sites for 
modification within the binding sites of the enzyme may thus be identified. This 



information provides an additional tool for determining the most efficient binding 
interactions, for example, increased hydrophobic interactions, between ACE and a 
chemical entity or compound. 

The structures and complexes of ACE may be refined using computer software - such 
as X-PLOR (Meth. EnzymoL, vol. 114 & 115, H. W. Wyckoff et al., eds., Academic 
Press (1985)), MLPHARE (Collaborative computational project Number 4. The 
CCP4 Suite: Programs for Protein Crystallography (1994) Acta Crystallogr. D 50, 
760.-763) and SHARP [De La Fortelle, E. & Bricogne, G. Maximum-likelihood 
heavy-atom parameters refinement in the MIR and MAD methods (1997) Methods 
Enzymol. 276, 472-494). Preferably, the complexes are refined using the program 
CNS (Briinger et al. (1998) Acta Crystallogr. D 54, 905-921). During the final stages 
of refinement water molecules, ions and inhibitor molecules may be inserted in the 
structure. This information may thus be used to optimise known classes of ACE 
modulators, eg. ACE inhibitors, and more importantly, to design and synthesise novel 
classes of ACE modulators. 

The overall figure of merit may be improved by iterative solvent flattening, phase 
combination and phase extension with the program SOLOMON [Abrahams, J. P. & 
Leslie, A. G. W. Methods used in structure determination of bovine mitochondrial Fl 
ATPase. (1996) Acta Crystallogr. D 52, 1.10-119]. 

The structure co-ordinates of ACE mutants provided in this invention also facilitate 
the identification of related proteins or enzymes analogous to ACE in function, 
structure or both, thereby further leading to novel therapeutic modes for treating or 
preventing ACE related diseases. 

The design of compounds that bind to or modulate ACE according to the present 
invention generally involves consideration of two factors. First, the compound must 
be capable of physically and structurally associating with ACE. Non-covalent 
molecular interactions important in the association of ACE with its substrate may 
include hydrogen bonding, van der Waals and hydrophobic interactions. Second, the 
compound must be able to assume a conformation that allows it to associate with 
ACE. Although certain portions of the compound may not directly participate in the 



association with ACE, those portions may still influence the overall conformation of 
the molecule. This may have a significant impact on potency. ' Such conformational 
requirements include the overall three-dimensional structure and orientation of the 
chemical entity or compound in relation to all or a portion of a binding site of ACE, or 
the spacing between functional groups of a compound comprising several chemical 
entities that directly interact with ACE. 

The potential modulating or binding effect of a chemical compound on ACE may be 
analysed prior to its actual synthesis and testing by the use of computer modelling 
techniques. If the theoretical structure of the given compound suggests insufficient 
interaction and association with ACE, then synthesis and testing of the compound 
may be obviated. However, if computer modelling indicates a strong interaction, the 
molecule may be synthesised and tested for its ability to bind to ACE and modulate 
(eg. inhibit) using the fluorescent substrate assay of Thomberry et al (2000; Methods 
EnzymoL 322, pp 100-110.. In this manner, synthesis of inactive compounds may be 
avoided. 

A modulating or other binding compound of ACE may be computationally evaluated 
and designed by means of a series of steps in which chemical entities or test 
compounds are screened and selected for their ability to associate ACE. 

A person skilled in the art may use one of several methods to screen chemical entities 
or test compounds for their ability to associate with ACE and more particularly with 
the individual binding sites of ACE. This process may begin by visual inspection of, 
for example, the active site on the computer screen based on the ACE co-ordinates of 
the present invention. Selected chemical entities or test compounds may then be 
positioned in a variety of orientations, or docked, with ACE. Docking may be 
accomplished using software such as Quanta and Sybyl, followed by energy 
minimisation and molecular dynamics with standard molecular mechanics force fields 
- such as CHARMM and AMBER. 

Specialised computer programs may also assist in the process of selecting chemical 
entities or test compounds. These include but are not limited to MCSS (Miranker and 
Karplus (1991) Proteins: Structure, Function and Genetics, 11, pp. 29-34); GRID 
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(Goodford (1985) J. Med. Chem, 28, pp. 849-857) and AUTODOCK (Goodsell and 
Olsen (1990), Proteins: Structure. Function, and Genetics, 8, pp. 195-202. 

Once suitable chemical entities or test compounds have been selected, they may be 
5 assembled into a single compound - such as an ACE modulator. Assembly may 
proceed by visual inspection of the relationship of the chemical entities or test 
compounds in relation to the structure co-ordinates of ACE. This may be followed by 
manual model building using software - such as Quanta, Sybyl or O [Jones, T. A, 
Zou, J. Y., Cowan, S. W. & Kjeldgaard, M. Improved methods for building protein 
10 models in electron density maps and the location of errors in these models (1991) 
Acta Crystallogr. A 47, 110-119]. 



Refinement of the model may be carried out using the program CNS [Brttnger, A. T. 
et al. Crystallography & NMR System: A new. software suite for macromolecular 
15 structure determination. (1998) Acta Crystallogr. D 54, 905-921]. 



Various programs may be used by a skilled person to connect the individual chemical 
entities or test compounds - such as 3D Database systems (Martin (1992) J. Med. 
Chem., 35, pp. 2145-2154) and CAVEAT (Bartlett et al. (1989) Royal Chem. Soc. 78, 
20 pp. 182-196). 

Rather than build an ACE inhibitor one chemical entity at a time, modulating or other 
ACE binding compounds may be designed as a whole or de novo using either an 
empty binding site or optionally including some portion(s) of a known inhibitors). 
25 Such compounds may be designed using programs that may include but are not 
limited to LEGEND (Nishibata and Itai (1991) Tetrahedron, 47, p. 8985) and LUDI 
(Bohm (1992) J. Comp. Aid. Molec. Design, 6, pp. 61-78). 

Other molecular modelling techniques may also be employed in accordance with mis 
30 invention - such as those described by Cohen et al., J. Med. Chem., 33, pp. 883-894 
(1990); Navia and Murcko (1992) Current Opinions in Structural Biology, 2, pp. 202- 
210 (1992). 



Once a compound has been designed or selected by the above methods, the efficiency 
with which that compound may bind to ACE may be computationally evaluated. 
Specific computer software may be used to evaluate the efficiency of binding (eg. to 
evaluate compound deformation energy and electrostatic interaction) — such as 
5 QUAOTA/CHARMM (Accelrys Inc., USA) and Insight n/Discover (Biosym 
Technologies Inc., San Diego, Calif, USA). These programs may be implemented, 
for instance, using a suitable workstation. Other hardware systems and software 
packages will be known to those persons skilled in the art. 

10 Once an ACE-modulating compound has been selected. or designed, as described 
above, substitutions may be made (eg. in atoms or side groups) to improve or modify 
the binding properties. The substitutions may be conservative ie. the replacement 
group may have approximately the same size, shape, hydrophobicity and charge as the 
original group. Such substituted chemical compounds may then be analysed for 

15 efficiency of binding to ACE by the same computer methods described above. 

Test compounds and modulators of ACE etc. which are identified using the crystal 
and the methods of the present invention may be screened in assays. Screening can 
be, for example in vitro, in cell culture, and/or in vivo. Biological screening assays 
20 preferably centre on activity-based response models, binding assays (which measure 
how well a compound binds), and bacterial, yeast and animal cell lines (which 
measure the biological effect of a compound in a cell). The assays can be automated 
for high capacity-high throughput screening (HTS) in which large numbers of 
compounds can be tested to identify compounds with the desired activity. 
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Current screening technologies are described in Handbook of Drug Screening, edited 
by Ramakrishna Seethala, Prabhavathi B. Fernandes. New York, NY, Marcel Dekker, 
(2001). 



30 ACE RELATED DISORDERS 



ACE related disorders include, but are not limited to, treatment of high blood pressure; 
treatment of heart failure; prolonging survival. of patients who have had a heart attack; 
preventing death by heart attack and stroke in patients with vascular disease and in 
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diabetics with other vascular risk factors; prolonging survival of patients with weak heart 
muscle; helping leaking heart valves; preserving kidney function in diabetics; and the 
treatment of new indications (e.g. polycythemia). Special groups of patients may also be 
treated with ACE inhibitors, including: patients with chronic pulmonary disease; patients 
with schleroderma; patients with atheroschlerosis; and patients with hyperuricemia. 

ACE CONSTRUCTS 

The ACE proteins produced by a host recombinant cell may be secreted or may be 
contained intracellularly depending on the nucleotide sequence and/or the vector used. 

As will be understood by those of skill in the art, expression vectors containing an ACE 
encoding nucleotide sequence or a mutant, variant, homologue, derivative or fragment 
thereof, may be designed with signal sequences which direct secretion of the ACE 
coding sequences through a particular prokaryotic or eukaryotic cell membrane. 

The ACE encoding sequence may be fused (eg. ligated) to nucleotide sequences 
encoding a polypeptide domain which will facilitate purification of soluble proteins 
(Kroll DJet al (1993) DNA Cell Biol 12:441-53). Preferably, the polypeptide domain 
which facilitates purification of soluble proteins is fused in frame with the ACE 
encoding sequence. Such purification facilitating domains include, but are not limited 
to, metal chelating peptides - such as histidine-tryptophan modules that allow 
purification on immobilised metals (Poratii J (1992) Protein Expr Purif 3, 263-281), 
protein A domains that allow purification on immobilised immunoglobulin, and the 
domain utilised in the FLAGS extension/affinity purification system (Tmmunex Corp, 
Seattle, WA). The inclusion of a cleavable linker sequence such as Factor XA or 
enterokinase (Invitrogen, San Diego, CA) between the purification domain and ACE is 
useful to facilitate purification. 

Preferably, the ACE construct is pEE-ACEANJ which encodes human tACE that 
lacks the heavily (^-glycosylated, 36-residue N-terminal sequence and is truncated 
after Ser625, thereby lacking most of the juxtamembrane stalk as well as the 
transmembrane and cytoplasmic domains as described in (7a). 




In another preferred embodiment, the ACE construct comprises an underglycosylated 
ACE, constructed by the removal of one or more glycosylation sites — such as one or 
more N-linked glycosylation sites. Glycosylation may be abolished using various 
5 methods known to a person skilled in the art as previously described. Preferably, a 
truncated form of tACE (tACEA36 lacking the first N-terminal 36 residues as well as 
the cytoplasmic domain is used for the construction of mutants (7a) and mutagenic 
oligonucleotides then used for altering the sites that are glycosylated. Preferably, the 
nucleotide sequence of each fragment is confirmed by DNA sequencing to ensure that 
10 only the desired mutation is created. 

Preferably, the ACE construct comprising underglycosylated ACE is pLEN- 
tACEA36g(l, 2, 3, 4) or is pLEN-tACEA3 6g( 1 , 3). 

15 HOST CELLS 

As used herein, the term "host cell" refers to any cell that comprises nucleotide 
sequences that are of use in the present invention, for example, nucleotide sequences 
encoding ACE. 

20 

Host cells may be transformed or transfected with a nucleotide sequence contained in 
a vector e.g. a cloning vector. Preferably, said nucleotide sequence is carried in a 
vector for the replication and/or expression of the nucleotide sequence. The cells will 
be chosen to be compatible with the said vector and may for example be prokaryotic 
25 (for example bacterial), fungal, yeast or plant cells. 

The gram-negative bacterium E. coli is widely used as a host for cloning nucleotide 
sequences. This organism is also widely used for heterologous nucleotide sequence 
expression. However, large amounts of heterologous protein tend to accumulate 
30 inside the cell. Subsequent purification of the desired protein from the bulk of E. coli 
intracellular proteins can sometimes be difficult. 
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In contrast to E. coli, bacteria from the genus Bacillus are very suitable "as 
heterologous hosts because of their capability to secrete proteins into the culture 
medium. Other bacteria suitable as hosts are those from the genera Streptomyces and 
Pseudomonas. 

5 

Depending on the nature of the polynucleotide and/or the desirability for further 
processing of the expressed protein, eukaryotic hosts including yeasts or other fungi 
may be preferred. In general, yeast cells are preferred over fungal cells because yeast 
cells are easier to manipulate. However, some proteins are either poorly secreted 
10 from the yeast cell, or in some cases are not processed properly (e.g. 
hyperglycosylation in yeast). In these instances, a different fungal host organism 
should be selected. 

Examples of expression hosts are fungi - such as Aspergillus species (such as those 
15 described in EP-A-0184438 and EP-A-0284603) and Trichoderma species; bacteria - 
such as Bacillus species (such as those described in EP-A-0 134048 and EP-A- 
0253455), Streptomyces species and Pseudomonas species; yeasts - such as 
Kluyveromyces species (such as those described in EP-A-0096430 and EP-A- 
0301670) and Saccharomyces species; and mammalian cells - such as CHO-K1 cells. 

20 

Hie use of host cells may provide for post-translational modifications (eg. 
glycosylation) as may be needed to confer optimal biological activity on recombinant 
expression products of the present invention. 

25 Aspects of the present invention also relate to host cells comprising the ACE 
constructs of the present invention. The ACE constructs may comprise a nucleotide 
sequence for replication and expression of the sequence. The cells will be chosen to 
be compatible with the vector and may for example be prokaryotic (for example 
bacterial), fungal, yeast or plant cells. 
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In a preferred embodiment, the host cells are mammalian cells - such as CHO-K1 
cells. CHO-K1 cells expressing ACE may be grown and maintained in accordance 
vnihYuetal. (1997). 



NUCLEOTIDE SEQUENCES 



As used herein, the term "nucleotide sequence" refers to nucleotide sequences, 
oligonucleotide sequences, polynucleotide sequences and variants, homologues, 
fragments and derivatives thereof (such as portions thereof) which comprise the 
nucleotide sequences encoding ACE, for example, testis ACE or somatic ACE. 

The nucleotide sequence* may be DNA or RNA of genomic or synthetic or 
recombinant origin, which may be double-stranded or single-stranded whether 
representing the sense or antisense strand or combinations thereof 

Preferably, the term nucleotide sequence is prepared by use of recombinant DNA 
techniques (e.g. recombinant DNA). The nucleotide sequences may include within 
them synthetic or modified nucleotides. A number of different types of modification 
to oligonucleotides are known in the art. These include methylphosphonate and 
phosphorothioate backbones, addition of acridine or polylysine chains at the 3' and/or 
5 f ends of the molecule. For the purposes of the present invention, it is to be 
understood that the nucleotide -sequences described herein may be modified by any 
method available in the art. 

It will be understood by a skilled person that numerous different nucleotide sequences 
can encode the same protein as a result of the degeneracy of the genetic code. In 
addition, it is to be understood that skilled persons may, using routine techniques, make 
nucleotide substitutions that do not substantially affect the activity encoded by the 
nucleotide sequence of the present invention to reflect the codon usage of any particular 
host organism in which the target is to be expressed. Thus, the terms "variant", 
"homologue" or "derivative" in relation to nucleotide sequences include any substitution 
of, variation of, modification of, replacement of, deletion of or addition of one (or more) 
nucleic acids from or to the sequence providing the resultant nucleotide sequence 
encodes a functional protein according to the present invention (or even a modulator of 
ACE according to the present invention if said modulator comprises a nucleotide 
sequence or an amino acid sequence). 
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AMINO ACID SEQUENCES 

As used herein, the term "amino acid sequence" is synonymous with the term 
"polypeptide" and/or the term "protein". In some instances, the term "amino acid 
sequence" is synonymous with the term "peptide".' In some instances, the term 
"amino acid sequence" is synonymous with the term "protein". 

Aspects of the present invention concern the use of amino acid sequences, which may 
be available in databases. These amino acid sequences may comprise ACE proteins. 

The amino acid sequence may be isolated from a suitable source, or it may be made 
synthetically or it may be prepared by use of recombinant DNA techniques. 

Preferably, ACE comprises SEQ ID No. 1 or SEQ ID No. 2, or a mutant, variant, 
homologue, derivative or fragment thereof. More preferably, ACE comprises SEQ ID 
No. 2, or a mutant, variant, homologue, derivative or fragment thereof. 



PURITY 

Preferably the protein solution used for crystallisation is at least 97.5% pure. More 
preferably, the protein solution used for crystallisation is at least 99.0% pure. Most 
preferably, the protein solution used for crystallisation is at least 99.5% pure. 

MODEL 

As used herein, the term "model" refers to a structural model such as a three 
dimensional (3D) structural model (or representation thereof) comprising ACE. 

Test compounds can be modelled that bind spatially and preferentially to ACE - such 
as to bind spatially and preferentially to ACE - for example, the active site of ACE. 

Preferably, the crystal model comprising ACE is built from all or a portion of the 
structure co-ordinates presented in Table A or Table B. 
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MUTANT 



As used herein, the term "mutant" refers to ACE comprising any one or more changes 
in the wild-type ACE sequence shown as SEQ ID No. 1 or one or more changes in the 
5 native ACE sequence shown as SEQ ID No. 2. 

The term "mutant 5 9 is not limited to any of the mutations described herein which are 
reflected in amino acid substitutions of the amino acid residues in ACE, but may also 
include, but are not limited to, other deletions or insertions of nucleotides which may 
10 result in changes in the amino acid residues in the amino acid sequence of ACE. 



In a preferred embodiment, mutations are located at amino acids 337 of SEQ ID No. 2 
or amino acids 90, 109, 155, 337 and 586 of SEQ ID No. 2. 

15 The present invention also enables the solving of the crystal structure of ACE mutants. 
More particularly, by virtue of the present invention, the location of the active site of 
ACE based on its crystal structure permits the identification of desirable sites for 
mutation. For example, one or more mutations may be directed to a particular site - such 
as the active site - or combination of sites of ACE. Similarly, only a location on, at or 

20 near the enzyme surface may be replaced, resulting in an altered surface charge of one or 
more charge units, as compared to the wild-type enzyme. Alternatively, an amino acid 
residue in ACE may be chosen for replacement based on its hydrophilic or hydrophobic 
characteristics. 

25 Such mutants may be characterised by any one of several different properties as 
compared with wild-type ACE. For example, such mutants may have altered surface 
charge of one or more charge units, or have an increased stability to subunit 
dissociation, or an altered substrate specificity in comparison with, or a higher 
specific activity than, wild-type ACE. 

30 * 

The mutants may be prepared in a number of ways that are known by a person skilled 
in the art. For example, mutations may be introduced by means of oligonucleotide- 
directed mutagenesis or other conventional methods. Alternatively, mutants of ACE 
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may be generated by site specific replacement of a particular amino acid with an 
unnaturally occurring amino acid. This may be achieved by growing a host organism 
capable of expressing either the wild-type or mutant polypeptide on a growth medium 
depleted of one or more natural amino acids but enriched in one or more 
5 corresponding unnaturally occurring amino acids. 

The expression, activity (eg. kinetic constants) and/or the crystallisation properties of 
the mutants may be determined using the methods described herein. 

10 VARIANTS/HOMOLOGUES/DERWATIVES/FRAGMENTS 

The ACE described herein is intended to include any polypeptide, which has the 
activity of die naturally occurring ACE and includes all vertebrate and mammalian 
forms. Such terms also include polypeptides that differ from naturally occurring 
15 forms of ACE by having amino acid deletions, substitutions, and additions, but which 
retain the activity of ACE. 

The term 'Variant" is used to mean a naturally occurring polypeptide or nucleotide 
sequences which differs from a wild-type or a native sequence. 

20 

The term "fragment" indicates that a polypeptide or nucleotide sequence comprises a 
fraction of a wild-type or a native sequence. It may comprise one or more large 
contiguous sections of sequence or a plurality of small sections. The sequence may also 
comprise other elements of sequence, for example, it may be a fusion protein with 
25 another protein. Preferably the sequence comprises at least 50%, more' preferably at 
least 65%, more preferably at least 80%, most preferably at least 90% of the wild-type 
sequence. 

The present invention also encompasses the use of variants, homologues and derivatives 
30 of nucleotide and amino acid sequences. Here, the term "homologue" means an entity 
having a certain homology with amino acid sequences or nucleotide sequences. Here, 
the term "homology" can be equated with "identity". 



In the present context, an homologous sequence is taken to include an amino acid 
sequence which may be at least 75, 85 or 90% identical, preferably at least 95 or 98% 
identical to the subject sequence. Although homology can also be considered in terms 
of similarity (i.e. amino acid residues having similar chemical properties/functions), it 
is preferred to express homology in terms of sequence identity. 

An homologous sequence is taken to include a nucleotide sequence which may be at 
least 75, 85 or 90% identical, preferably at least 95 or 98% identical to the subject 
sequence. 

Homology comparisons can be conducted by eye, or more usually, with the aid of 
readily available sequence comparison programs. These commercially available 
computer programs can calculate % homology between two or more sequences. 

% homology may be calculated over contiguous sequences, i.e. one sequence is 
aligned with the other sequence and each amino acid in one sequence is directly 
compared with the corresponding amino acid in the other sequence, one residue at a 
time. This is called an "ungapped" alignment. Typically, such ungapped alignments 
are performed only over a relatively short number of residues. 

Although this is a very simple and consistent method, it fails to take into 
consideration that, for example, in an otherwise identical pair of sequences, one 
insertion or deletion will cause the following amino acid residues to be put out of 
alignment, thus potentially resulting in a large reduction in % homology when a 
global alignment is performed. Consequently, most sequence comparison methods 
are designed to produce optimal alignments that take into consideration possible 
insertions and deletions without penalising unduly the overall homology score. This 
is achieved by inserting "gaps" in the sequence alignment to try to maximise local 
homology. 

However, these more complex methods assign "gap penalties" to each gap that occurs 
in the alignment so that, for the same number of identical amino acids, a sequence 
alignment with as few gaps as possible - reflecting higher relatedness between the two 
compared sequences - will achieve a higher score than one with many gaps. "Affine 
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gap costs" are typically used that charge a relatively high cost for the existence of a 
gap and a smaller penalty for each subsequent residue in the gap. This is the most 
commonly used gap scoring system. High gap penalties will of course produce 
optimised alignments with fewer gaps. Most alignment programs allow the gap 
penalties to be modified. However, it is preferred to use the default values when 
using such software for sequence comparisons. For example when using the GCG 
Wisconsin Bestfit package the default gap penalty for amino acid sequences is -12 for 
a gap and -4 for each extension. 

Calculation of maximum % homology therefore firstly requires the production of an 
optimal alignment, taking into consideration gap penalties. A suitable computer 
program for carrying out such an alignment is the GCG Wisconsin Bestfit package 
(University of Wisconsin, -U.S.A.; Devereux et al, 1984, Nucleic Acids Research 
12:387). Examples of other software than can perform sequence comparisons 
include, but are not limited to, the BLAST package (see Ausubel et al, 1999 ibid - 
Chapter 18), FASTA (Atschul et al, 1990, J. Mol. Biol, 403-410) and the 
GENEWORKS suite of comparison tools. Both BLAST and FASTA are available for 
offline and online searching (see Ausubel et al, 1999 ibid, pages 7-58 to 7-60). 
However, for some applications, it is preferred to use the GCG Bestfit program. A 
new tool, called BLAST 2 Sequences is also available for comparing protein and 
nucleotide sequence (see FEMS Microbiol Lett 1999 174(2): 247-50; FEMS 
Microbiol Lett 1999 177(1): 187-8) 

Although the final % homology can be measured in terms of identity, the alignment 
process itself is typically not based on an all-of-nothing pair comparison. Instead, a 
scaled similarity score matrix is generally used that assigns scores to each pairwise 
comparison based on chemical similarity or evolutionary distance. An example of 
such a matrix commonly used is the BLOSUM62 matrix - the default matrix for the 
BLAST suite of programs. GCG Wisconsin programs generally use either the public 
default values or a custom" symbol comparison table if supplied (see user manual for 
further details). For some applications, it is preferred to use the public default values 
for the GCG package, or in the case of other software, the default matrix, such as 
BLOSUM62. 



Once the software has produced an optimal alignment, it is possible to calculate % 
homology, preferably % sequence identity. The software typically does this as part of 
the sequence comparison and generates a numerical result. 

By way of example, homologous sequences of ACE include, but are not limited to 
human ACE S somatic ACE (accession number: J04144), human ACE T testis ACE 
(accession number: M26657), human ACEH/ACE2 (accession numbers: AAF78220; 
BAB40370; AAF99721), chimp ACE T (accession number: AF193487J2), rabbit 
ACE T mature protein (accession number: P22968), rabbit ACE T full pre-protein 
(accession number: P22968), mouse ACE T testis ACE (accession number: P22967), 
bovine Cdom ACE S C-domain, rat Cdom ACE S C-domain (derived from accession 
number P47820; starting D616), human Ndom ACE S N-domain (derived from 
accession number P12821 (J04144)), chimp Ndom ACE S N-domain (derived from 
accession number AF193487_1), rabbit Ndom ACE S N-domain (derived from 
P 12822), bovine Ndom (Bovine {Bos taunts} ACE S N-domain), mouse Ndom ACE S 
N-domain (derived from accession number P09470), rat Ndom ACE S N-domain 
(derived from accession number P47820), chick ACE (partial ACE accession number 
Q10751), dros AnCE (derived from accession number Q10714), dros ACEr (derived 
from accession number X96913), buffalo fly ACE (derived from accession number 
Q10715), and silkworm ACE (derived from accession number BAA97657), tick ACE 
(derived from accession number U62809). 

The sequences may also have deletions, insertions or substitutions of amino acid 
residues, which produce a silent change and result in a functionally equivalent 
substance. Deliberate amino acid substitutions may be made on the basis of similarity 
in polarity, charge; solubility, hydrophobic^ hydrophilicity, and/or the amphipathic 
nature of the residues as long as the secondary binding activity of the substance is 
retained. For example, negatively charged amino acids include aspartic acid and 
glutamic acid; positively charged amino acids include lysine and arginine; and amino 
acids with uncharged polar head groups having similar hydrophilicity values include 
leucine, isoleucine, valine, glycine, alanine, asparagine, glutamine, serine, threonine, 
phenylalanine, and tyrosine. 



# 
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Conservative substitutions may be made, for example according to the Table below. 
Amino acids in the same block in the second column and preferably in the same line 
in the third column may be substituted for each other: 



ALIPHATIC 


Non-polar 


GAP 






ILV- 




Polar - uncharged 


CSTM 






NQ j 




Polar - charged 


DE 






KR 


AROMATIC 




HFWY 



5 

Homologous substitution (substitution and replacement are both used herein to mean 
the interchange of an existing amino acid residue, with an alternative residue) may 
occur i.e. like-for-like substitution such as basic for basic, acidic for acidic, polar for 
polar etc. Non-homologous substitution may also occur i.e. from one class of residue 
10 to another or alternatively involving the inclusion of unnatural amino acids such as 
ornithine (hereinafter referred to as Z), diaminobutyric acid ornithine (hereinafter 
referred to as B), norleucine ornithine (hereinafter referred to as O), pyriylalanine, 
thienylalanine, naphthyl alanine and phenylglycine. 

15 Replacements may also be made by unnatural amino acids include; alpha* and alpha- 
disubstituted* amino acids, N-aJkyl amino acids*, lactic acid*, halide derivatives of 
natural amino acids, such as trifluorotyrosine*, p-Cl-phenylalanine*, p-Br- 
phenylalanine*, p-I-phenylalanine*, L-allyl-glycine*, B-alanine*, L-a-amino butyric 
acid*, L-y-amino butyric acid*, L-a-amino isobutyric acid*, L-s-amino caproic acid , 7- 

20 amino heptanoic acid*, L-methionine sulfone # *, L-norleucine*, L-norvaline*, p-nitro-L- 
phenylalanine*, L-hydroxyproline*, L-thioproline*, methyl derivatives of phenylalanine 
(Phe) such as 4-methyl-Phe*, pentamethyl-Phe*, L-Phe (4-amino) # , L-Tyr (methyl)*, L- 
Phe (4-isopropyl)*, L-Tic (l,2 5 3,4-tetrahydroisoquinoline-3-carbo>r/l acid)*, L- 
diaminopropionic acid # and L-Phe (4-benzyl)*. The notation * has been utilised for the 

25 purpose of the discussion above (relating to homologous or non-homologous 
substitution), to indicate the hydrophobic nature of the derivative whereas # has been 
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utilised to indicate the hydrophilic nature of the" derivative, #* indicates amphipathic 



The term "derivative" or "derivatised" as used herein includes chemical modification of 



acyl group or an amino group. 

Variant amino acid sequences may include suitable spacer groups that may be inserted 
10 between any two amino acid residues of the sequence including alkyl groups such as 
methyl, ethyl or propyl groups in addition to amino acid spacers such as glycine or p- 
alanine residues. A further form of variation, involves the presence of one or more 
amino acid residues in peptoid form, will be well understood by those skilled in the 
art. For the avoidance of doubt, "the peptoid form" is used to refer to variant amino 
15 acid residues wherein the a-carbon substituent group is on the residue's nitrogen atom 
rather than the a-carbon. Processes for preparing peptides in the peptoid form are 
known in the art, for example Simon RJ et al., PNAS (1992) 89(20), 9367-9371 and 
Horwell DC, Trends Biotechnol. (1995) 13(4), 132-134. 

20 TEST COMPOUNDS 

As used herein, the term "test compound" includes, but is not limited to, a compound 
which may be obtainable from or produced by any suitable source, whether natural or 
not. 



The test compound may be designed or obtained from a library of compounds, which 
may comprise peptides, as well as other compounds, such as small organic molecules 
and particularly new lead compounds. By way of example, the test compound may be 
a natural substance, a biological macromolecule, or an extract made from biological 
30 materials - such as bacteria, fungi, or animal (particularly mammalian) cells or tissues, 
an organic or an inorganic molecule, a synthetic test compound, a semi-synthetic test 
compound, a structural or functional mimetic, a peptide, a peptidomimetics, a 
derivatised test compound, a peptide cleaved from a whole protein, or a peptide 



characteristics. 
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an entity — such as test compound or an ACE modulator. Illustrative of such chemical 
modifications would be replacement of hydrogen by a halo group, an alkyl group, an 
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synthesised synthetically (such as, by way of example, either using a peptide 
synthesiser or by recombinant techniques or combinations thereof, a recombinant test 
compound, a natural or a non-natural test compound, a fusion protein or equivalent 
thereof and mutants, derivatives or combinations thereof. The test compound may 
even be a compound that is a modulator of ACE - such as a known inhibitor of ACE - 
that has been modified in some way eg. by recombinant DNA techniques or chemical 
synthesis techniques. 

* 

Typically, the test compound will be prepared by recombinant DNA techniques 
and/or chemical synthesis techniques. 

Once a test compound capable of interacting ACE has been identified, further steps 
may be carried out to select and/or to modify the test compounds and/or to modify 
existing compounds, such that they are able to modulate ACE. 

MODULATING ACE 

As herein, the term "modulating" refers to preventing, suppressing, inhibiting, 
alleviating, restorating, elevating, increasing or otherwise affecting ACE. 

The term "ACE modulator" may refer to a single entity or a combination of entities. 

The ACE modulator may be an antagonist or an agonist of ACE. 

As used herein, the term "agonist" means any entity, which is capable of interacting 
(eg. binding) with ACE and which is capable of increasing a proportion of the ACE 
that is in an active form, resulting in an increased biological response. 

As used herein, the term "antagonist" means any entity, which is capable of 
interacting (eg. binding) with ACE and which is capable of decreasing (eg. inhibiting) 
a proportion of the ACE that is in an active form, resulting in a decreased biological 
response. 



Preferably, the ACE modulators of the present invention are antagonists of ACE. 



The modulator of ACE may be an organic compound or other chemical. The 
modulator of ACE may be a compound, which is obtainable from or produced by any 
suitable source, whether natural or artificial. The modulator of ACE may be an amino 
acid molecule, a polypeptide, or a chemical derivative thereof, or a combination 
thereof. The modulator of ACE may even be a polynucleotide molecule - which may 
be a sense or an anti-sense molecule. The modulator of ACE may even be an 
antibody. 

The modulator of ACE may be designed or obtained from a library of compounds, 
which may comprise peptides, as well as other compounds, such as small organic 
molecules. 

By way of example, the modulator of ACE may be a natural substance, a biological 
macromolecule, or an extract made from biological materials such as bacteria, fungi, 
or animal (particularly mammalian) cells or tissues, an organic or an inorganic 
molecule, a synthetic agent, a semi-synthetic agent, a structural or functional mimetic, 
a peptide, a peptidomimetic, a derivatised agent, a peptide cleaved from a whole 
protein, or a peptide synthesised synthetically (such as, by way of example, either 
using a peptide synthesiser or by recombinant techniques or combinations thereof, a 
recombinant agent, an antibody, a natural or a non-natural agent, a fusion protein or 
equivalent thereof and mutants, derivatives or combinations thereof). 

Typically, the modulator of ACE will be an organic compound. Typically, the 
organic compounds will comprise two or more hydrocarbyl groups. Here, the term 
"hydrocarbyl group" means a group comprising at least C and H and may optionally 
comprise one or more other suitable substituents. Examples of such substituents may 
include halo-, alkoxy-, nitro-, an alkyl group, a cyclic group etc. In addition to the 
possibility of the substituents being a cyclic group, a combination of substituents may 
form a cyclic group. If the hydrocarbyl group comprises more than one C then those 
carbons need not necessarily be linked to each other. For example, at least two of the 
carbons may be linked via a suitable element or group. Thus, the hydrocarbyl group 
may contain hetero atoms. Suitable hetero atoms will be apparent to those skilled in 
the art and include, for instance, sulphur, nitrogen and oxygen. For some 
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applications, preferably the modulator of ACE comprises at least one cyclic group. 
The cyclic group may be a polycyclic group, such as a non-fused polycyclic group. 
For some applications, the modulator of ACE comprises at least the one of said cyclic 
groups linked to another hydrocarbyl group. 

The modulator of ACE may contain halo groups, for example, fluoro, chloro, bromo or 
iodo groups. 

The modulator of ACE may contain one or more of alkyl, alkoxy, alkenyl, alkylene 
and alkenylene groups - which may be unbranched- or branched-chain. 

The modulator of ACE may be in the form of a pharmaceutically acceptable salt - 
such as an acid addition salt or a base salt - or a solvate thereof, including a hydrate 
thereof. For a review on suitable salts see Berge et a/, (1977) J. Pharm. Sci. 66, 1-19. 

The modulator of ACE may be structurally novel modulators of ACE. 

The modulators of ACE may be analogues of known modulators of ACE - such as 
known inhibitors of ACE (for example, snake venom, peptides produced by 
enzymatic hydrolysis of casein or fish meat protein, or Benazepril, Captopril, 
Cilazapril, Enalapril, Fosinopril, .Lisinopril, Moexipril, Perindopril, Quinapril, 
Ramipril, Trandolapril and Enalaprilat. 

Preferably, the ACE modulators have improved properties over those previously 
available, for example, fewer side effects - such as cough (eg. dry, persistent); fever and 
chills; hoarseness; swelling of face, mouth, hands, or feet; trouble in swallowing or 
breathing; itching of skin; yellow eyes or skin; dizziness, light-headedness, or fainting; 
skin rash, with or without itching; fever, or joint pain; abdominal pain, abdominal 
distention; nausea, or vomiting; chest pain, confusion; irregular heartbeat; nervousness; 
numbness or tingling in hands, feet, or lips; weakness or heaviness of legs; headache, 
diarrhoea; loss of taste; nausea; unusual tiredness. 



The modulator of ACE may be a mimetic. 
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The modulator of ACE may also be chemically modified". 

The modulator of ACE may be capable of displaying other therapeutic properties. 

5 The modulator of ACE may be used in combination with one or more other 
pharmaceutically active agents. 

If combinations of active agents are administered, then they may be administered 
simultaneously, separately or sequentially. 



MIMETIC 



As used herein, the term "mimetic" relates to any chemical which includes, but is not 
limited to, a peptide, polypeptide, antibody or other organic chemical which has the 
15 same qualitative activity or effect as a known compound. That is, the mimetic is a 
functional equivalent of a known compound. 

STEREO AND GEOMETRIC ISOMERS 



20 Modulators of ACE may exist as stereoisomers and/or geometric isomers — e.g. they 
may possess one or more asymmetric and/or geometric centres and so may exist in 
two or more stereoisomeric and/or geometric forms. The present invention 
contemplates the use of all the individual stereoisomers and geometric isomers, and 
mixtures thereof. 

25 

PHARMACEUTICAL SALT ' 

Modulators of ACE may be administered in the form of a pharmaceutically acceptable 
salt. 

30 

PharmaceuticaUy-acceptable salts are well known to those skilled in the art, and for 
example include those mentioned by Berge et al 9 (1977) X Pharm. Set, 66, 1-19. 
Suitable acid addition salts are formed from acids which form non-toxic salts and 
include the hydrochloride, hydrobromide, hydroiodide, nitrate, sulphate, bisulphate, 



# 



44 



10 



15 



phosphate, hydro genphosphate, acetate, trifluoroacetate, gluconate, lactate, salicylate, 
citrate, tartrate, ascorbate, succinate, maleate, fumarate, gluconate, formate, benzoate, 
methanesulphonate, ethanesulphonate, benzenesulphonate and p-toluenesulphonate 
salts. 

When one or more acidic moieties are present, suitable pharmaceutical^ acceptable 
base addition salts can be formed from bases which form non-toxic salts and include 
Hie aluminium, calcium, lithium, - magnesium, potassium, sodium, zinc, and 
pharmaceutically-active amines such as diethanolamine, salts. 

A pharmaceutical^ acceptable salt of a modulator of ACE may be readily prepared 
by mixing together solutions of the modulator of ACE and the desired acid or base, as 
appropriate. The salt may precipitate from solution and be collected by filtration or 
may be recovered by evaporation of the solvent. 

The modulator of ACE may exist in polymorphic form. 



The modulator of ACE may contain one or more asymmetric carbon atoms and 
therefore exists in two or more stereoisomer^ forms. Where a modulator of ACE 
20 contains an alkenyl or alkenylene group, cis (E) and trans (Z) isomerism may also 
occur. The present invention includes the individual stereoisomers of the modulator 
of ACE and, where appropriate, the individual tautomeric forms thereof, together with 
mixtures thereof. 

25 Separation of diastereoisomers or cis and trans isomers " may be achieved by 
conventional techniques, e.g. by fractional crystallisation, chromatography or 
H.P.L.C. of a stereoisomer^ mixture of the modulator of ACE or a suitable salt or 
derivative thereof. An individual enantiomer of the modulator of ACE may also be 
prepared from a corresponding optically pure intermediate or by resolution, such as 

30 by H.P.L.C. of the corresponding racemate using a suitable chiral support or by 
fractional crystallisation of the diastereoisomeric salts formed by reaction of the 
corresponding racemate with a suitable optically active acid or base, as appropriate. 



The modulator of ACE may also include all suitable isotopic variations of the 
modulator of ACE or a pharmaceutical^ acceptable salt thereof. An isotopic 
variation of an modulator of ACE or a pharmaceutically acceptable salt thereof is 
defined as one in which at least one atom is replaced by an atom having the same 
atomic number but an atomic mass different from the atomic mass usually found in 
nature. Examples of isotopes that can be incorporated into the modulator of ACE and 
pharmaceutically acceptable salts thereof include isotopes of hydrogen, carbon, 
nitrogen, oxygen, phosphorus, sulphur, fluorine and chlorine such as 2 H, 3 H, 13 C, k 14 C, 
15 N, 17 0, ls O, 31 P, 32 P, 35 S, 18 F and 36 C1, respectively. Certain isotopic variations of 
the modulator of ACE and pharmaceutically acceptable salts thereof, for example, 
those in which a radioactive isotope such as 3 H or 14 C is incorporated, are useful in 
drug and/or substrate tissue distribution studies. Tritiated, i.e., 3 H, and carbon-14, i.e., 
14 C, isotopes are particularly preferred for their ease of preparation and detectability. 
Further, substitution with isotopes such as deuterium, i.e., 2 H, may afford certain 
therapeutic advantages resulting from greater metabolic stability, for example, 
increased in vivo half-life or reduced dosage requirements and hence may be preferred 
in some circumstances. Isotopic variations of the modulator of ACE and 
pharmaceutically acceptable salts thereof of this invention can generally be prepared 
by conventional procedures using appropriate isotopic variations of suitable reagents. 

It will be. appreciated by those skilled in the art that the agent may be derived from a 
prodrug. Examples of prodrugs include entities that have certain protected group(s) 
and which may not possess pharmacological activity as such, but may, in certain 
instances, be administered (such as orally or parenterally) and thereafter metabolised 
in the body to form the modulator of ACE which is pharmacologically active. 

It will be further appreciated that certain moieties known as "pro-moieties", for 
example as described in "Design of Prodrugs" by H. Bundgaaxd, Elsevier, 1985 (the 
disclosured of which is hereby incorporated by reference), may be placed on 
appropriate functionalities of the modulator of ACE. Such prodrugs are also included 
within the scope of the invention. 



PHARMACEUTICALLY ACTIVE SALT 

The modulator of ACE may be administered as a pharmaceutically acceptable salt. 
Typically, a pharmaceutically acceptable salt may be readily prepared by using a 
5 desired acid or base, as appropriate. The salt may precipitate from solution and be 
collected by filtration or may be recovered by evaporation of the solvent. 

CHEMICAL SYNTHESIS METHODS 

10 The modulator of ACE of the present invention may be prepared by chemical 
synthesis techniques. 

It will be apparent to those skilled in the art that sensitive functional groups may need to 
be protected and deprotected during synthesis of a compound of the invention. This may 
1 5 be achieved by conventional techniques, for example as described in "Protective Groups 
in Organic Synthesis" by T W Greene and PGM Wuts, John Wiley and Sons Inc. 
(1991), and by P.J.Kociensld, in "Protecting Groups", Georg Thieme Verlag (1994). 

It is possible during some of the reactions that any stereocentres present could, under 
20 certain conditions, be racemised, for example if a base is used in a reaction with a 
substrate having an having an optical centre comprising a base-sensitive group. This is 
possible during e.g. a guanylation step. It should be possible to circumvent potential 
problems such as this by choice of reaction sequence, conditions, reagents, 
protection/deprotection regimes, etc. as is well-known in the art. 

25 

The compounds and salts may be separated and purified by conventional methods. 

Separation of diastereomers may be achieved by conventional techniques, e.g. by 
fractional crystallisation, chromatography or H.P.L.C. of a stereoisomeric mixture of 
30 a compound of formula (I) or a suitable salt or derivative thereof. An individual 
enantiomer of a compound of formula (!) may also be prepared from a corresponding 
optically pure intermediate or by resolution, such as by H.P.L.C. of the corresponding 
racemate using a suitable chiral support or by fractional crystallisation of the 
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diastereomeric salts formed by reaction of the corresponding" racemate with a suitably 



ACE, modulators of ACE or variants, homologues, derivatives, fragments or mimetics 
5 thereof may be produced using chemical methods to synthesise the ACE or the 
modulator of ACE in whole or in part. For example, a ACE peptide or a modulator of 
ACE that is a peptide can be synthesised by solid phase techniques, cleaved from the 
resin, and purified by preparative high performance liquid chromatography (e.g., 
Creighton (1983) Proteins Structures And Molecular Principles, WH Freeman and Co, 
10 New York NY). The composition of the synthetic peptides may be confirmed by 
amino acid analysis or sequencing (e.g., the Edman degradation procedure; Creighton, 
supra). 

Synthesis of peptides (or variants, homologues, derivatives, fragments or mimetics 
15 thereof) may be performed using various solid-phase techniques (Roberge JY et al 
(1995) Science 269: 202-204) and automated synthesis may be achieved, for example, 
using the ABI 43 1 A Peptide Synthesizer (Perkin Elmer) in accordance with the 
instructions provided by the manufacturer. Additionally, the amino acid sequences 
comprising the modulator of ACE, may be altered during direct synthesis and/or 
20 combined using chemical methods with a sequence from other subunits, or any part 
thereof, to produce a variant modulator of ACE. 

CHEMICAL MODIFICATION 

25 The modulator of ACE may be a chemically modified modulator of ACE. 

The chemical modification of a modulator of ACE may either enhance or reduce 
interactions between the modulator of ACE and the target — such as hydrogen bonding 
interactions, charge interactions, hydrophobic interactions, van der Waals interactions 
30 or dipole interactions. 



optically active acid or base. 



In one aspect, the modulator of ACE may act as a model (for example, a template) for 
the development of other compounds. 




PHARMACEUTICAL COMPOSITIONS 

The components may be administered alone but will generally be administered as a 
5 pharmaceutical composition - e.g. when the components are in a mixture with a 
suitable pharmaceutical excipient, diluent or carrier selected with regard to the 
intended route of administration and standard pharmaceutical practice. 

For example, the components can be administered in the form of tablets, capsules, 
10 ovules, elixirs, solutions or suspensions, which may contain flavouring or colouring 
agents, for imm ediate-, delayed-, modified-, sustained-, pulsed- or controlled-release 
applications. 

If the pharmaceutical is a tablet, then the tablet may contain excipients such as 
15 macrocrystalline cellulose, lactose, sodium citrate, calcium carbonate, dibasic calcium 
phosphate and glycine, disintegrants such as starch (preferably com, potato or tapioca 
starch), sodium starch glycollate, croscarmellose sodium and certain complex 
silicates, and granulation binders - such as polyvinylpyrrolidone, 
hydroxypropylmethylcellulose (HPMC), hydroxypropylcellulose (HPC), sucrose, 
20 gelatin and acacia. Additionally, lubricating agents - such as magnesium stearate, 
stearic acid, glyceryl behenate and talc may be included. 

Solid compositions of a similar type may also be employed as fillers in gelatin 
capsules. Preferred excipients in this regard include lactose, starch, a cellulose, milk 
25 sugar or high molecular weight polyethylene glycols. For aqueous suspensions and/or 
elixirs, the modulator of ACE may be combined with various sweetening or 
flavouring agents, colouring matter or dyes, with emulsifying and/or suspending 
agents and with diluents such as water, ethanol, propylene glycol and glycerin, and 
combinations thereof. 

30 

The routes for administration (delivery) may include, but are not limited to, one or 
more of oral (e.g. as a tablet, capsule, or as an ingestable solution), topical, mucosal 
(e.g. as a nasal spray or aerosol for inhalation), nasal, parenteral (e.g. by an injectable 
form), gastrointestinal, intraspinal, intraperitoneal, intramuscular, intravenous, 



intraventricular, intrauterine, intraocular, intradermal, intracranial, intratracheal, 
intravaginal, intracerebroventricular, intracerebral, subcutaneous, ophthalmic 
(including intravitreal or intracameral), transdermal, rectal, buccal, vaginal, epidural, 
sublingual. 

Pharmaceutical compositions of the present invention may comprise a therapeutically 
effective amount of ACE, one or more modulators of ACE or combinations thereof. 

The pharmaceutical compositions may be for human or animal usage in human and 
veterinary medicine and will typically comprise any one or more of a 
pharmaceutically acceptable diluent, carrier, or excipient. Acceptable carriers or 
diluents for therapeutic use are well known in the pharmaceutical art, and are 
described, for example, in Remington's Pharmaceutical Sciences, Mack Publishing 
Co. (A. R. Gennaro edit. 1985). The choice of pharmaceutical carrier, excipient or 
diluent can be selected with regard to the intended route of administration and 
standard pharmaceutical practice. The pharmaceutical compositions may comprise as 
- or in addition to - the carrier, excipient or diluent any suitable binder(s), 
lubricant(s), suspending agent(s), coating agent(s), solubilising agent(s). 

Preservatives, stabilizers, dyes and even flavouring agents may be provided in the 
pharmaceutical composition. Examples of preservatives include sodium benzoate, 
sorbic acid and esters of p-hydroxybenzoic acid. Antioxidants and suspending agents 
may be also used. 

There may be different composition/formulation requirements dependent on the 
different delivery systems. By way of example, the pharmaceutical composition of 
the present invention may be formulated to be administered using a mini-pump or by 
a mucosal route, for example, as a nasal spray or aerosol for inhalation or ingestable 
solution, or parenterally in which the composition is formulated by an injectable form, 
for delivery, by, for example, an intravenous, intramuscular or subcutaneous route. 
Alternatively, the formulation may be designed to be administered by a number of 
routes. 
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If the modulator of ACE is to be administered mucosally through the gastrointestinal 
mucosa, it should be able to remain stable during transit though the gastrointestinal 
tract; for example, it should be resistant to proteolytic degradation, stable at acid pH 
and resistant to the detergent effects of bile. 

Where appropriate, the pharmaceutical compositions may be administered by 
inhalation, in the form of a suppository or pessary, topically in the form of a lotion, 
solution, cream, ointment or dusting powder, by use of a skin patch, orally in the form 
of tablets containing excipients such as starch or lactose, or in capsules or ovules 
) either alone or in admixture with excipients, or in the form of elixirs, solutions or 
suspensions containing flavouring or colouring agents, or the pharmaceutical 
compositions can be injected parenterally, for example intravenously, intramuscularly 
or subcutaneously. For parenteral administration, the compositions may be best used 
in the form of a sterile aqueous solution which may contain other substances, for 
5 example enough salts or monosaccharides to make the solution isotonic with blood. 
For buccal or sublingual administration the compositions may be administered in the 
form of tablets or lozenges which can be formulated in a conventional manner. 

The modulators of ACE may be used in combination with a cyclodextrin. 
0 Cyclodextrins are known to form inclusion and non-inclusion complexes with drug 
molecules. Formation of a drug-cyclodextrin complex may modify the solubility, 
dissolution rate, bioavailability and/or stability property of a drug molecule. Drug- 
cyclodexlrin complexes are generally useful for most dosage forms and administration 
routes. As an alternative to direct complexation with the drug the cyclodextrin may 
25 be used as an auxiliary additive, e.g. as a carrier, diluent or solubiliser. Alpha-, beta- 
and gamma-cyclodextrins are most commonly used and suitable examples are 
described in WO-A-91/11172, WO-A-94/02518 and WO-A-98/55148. 

If the modulator of ACE is a protein, then said protein may be prepared in situ in the 
30 subject being treated. In this respect, nucleotide sequences encoding said protein may 
be delivered by use of non-viral techniques (e.g. by use of liposomes) and/or viral 
techniques (e.g. by use of retroviral vectors) such that the said protein is expressed 
from said nucleotide sequence. 




DOSE LEVELS 

Typically, a physician will determine the actual dosage which will be most suitable 
5 for an individual subject. The specific dose level and frequency of dosage for any 
particular patient may be varied and will depend upon a variety of factors including 
the activity of the specific compound employed, the metabolic stability and length of 
action of that compound, the age, body weight, general health, sex, diet, mode and 
time of administration, rate of excretion, drug combination, the severity of the 
10 particular condition, and the individual undergoing therapy. 

FORMULATION 

The components) may be formulated into a pharmaceutical composition, such as by 
15 mixing with one or more of a suitable carrier, diluent or excipient, by using 
techniques that are known in the art. 

VECTOR 

20 Aspects of the present invention relate to a vector comprising a nucleotide sequence - 
such as a nucleotide sequence encoding ACE or a modulator of ACE - administered to 
a subject. 

Preferably, ACE or the modulator of ACE is prepared and/or delivered using a genetic 
25 vector. 

As it is well known in the art, a vector is a tool that allows or facilitates the transfer of 
an entity from one environment to another. In accordance with the present invention, 
and by way of example, some vectors used in recombinant DNA techniques allow 
30 entities, such as a segment of DNA (such as a heterologous DNA segment, such .as a 
heterologous cDNA segment), to be transferred into a host and/or a target cell for the 
purpose of replicating the vectors comprising nucleotide sequences and/or expressing 
the proteins encoded by the nucleotide sequences. Examples of vectors used in 
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recombinant DNA techniques include, but are not limited to, plasmids, chromosomes, 
artificial chromosomes or viruses. 

The term "vector" includes expression vectors and/or transformation vectors. 

The term "expression vector" means a construct capable of in vivo or in vitro/ex vivo 
expression. 

The term "transformation vector" means a construct capable of being transferred from 
one species to another. 

REGULATORY SEQUENCES 

ha some applications, nucleotide sequences are operably linked to a regulatory 
sequence which is capable of providing for the expression of the nucleotide sequence, 
such as by a chosen host cell. By way of example, a vector comprising the ACE 
nucleotide sequence is operably linked to such a regulatory sequence i.e. the vector is 
an expression vector. 

The term "operably linked" refers to a juxtaposition wherein the components 
described are in a relationship permitting them to function in their intended manner. 
A regulatory sequence "operably linked" to a coding sequence is ligated in such a way 
that expression of the coding sequence is achieved under conditions compatible with 
the control sequences. 

The term "regulatory sequences" includes promoters and enhancers and other 
expression regulation signals. 

The term "promoter" is used in the normal sense of the art, e.g. an RNA polymerase 
binding site. 



Enhanced expression of a nucleotide sequence, for example, a nucleotide sequence 
encoding ACE - may also be achieved by the selection of heterologous regulatory 
regions, e.g. promoter, secretion leader and terminator regions, which serve to 
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increase expression and, if desired, secretion levels of the protein of interest from the 
chosen expression host and/or to provide for the inducible control of the expression of 



ACE. In eukaryotes, polyadenylation sequences may be operably connected to the 
ACE nucleotide sequence. 

5 

Preferably, the ACE nucleotide sequence is operably linked to at least a promoter. 

Aside from the promoter native to the gene encoding the ACE nucleotide sequence, 
other promoters may be used to direct expression of the ACE polypeptide. The 
10 promoter may be selected for its efficiency in directing the expression of the ACE 
nucleotide sequence in the desired expression host. 

In another embodiment, a constitutive promoter may be selected to direct , the 
expression of the ACE nucleotide sequence of the present invention. Such an 
15 expression construct may provide additional advantages since it circumvents the need 
to culture the expression hosts on a medium containing an inducing substrate. 

Hybrid promoters may also be used to improve inducible regulation of the expression 
construct. 



Hie promoter can additionally include features to ensure or to increase expression in a 
suitable host. For example, the features can be conserved regions such as a Pribnow 
Box or a TATA box. The promoter may even contain other sequences to affect (such 
as to maintain, enhance, decrease) the levels of expression of the ACE nucleotide 
25 sequence. For example, suitable other sequences include the Shl-intron or an ADH 
intron. Other sequences include inducible elements - such as temperature, chemical, 
light or stress inducible elements. Also, suitable elements to enhance transcription or 
translation may be present. 

30 EXPRESSION VECTOR 



20 



Preferably, nucleotide sequences - such as nucleotide sequences encoding ACE or 
modulators of ACE - are inserted into a vector that is operably linked to a control 
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sequence that is capable of providing for the expression of the coding sequence by the 
host cell. 

Nucleotide sequences produced by a host recombinant cell may be secreted or may be 
contained intracellularly depending on the sequence and/or the vector used. As will 
be understood by those of skill in the art, expression vectors can be designed with 
signal sequences, which direct secretion of the nucleotide sequence through a 
particular prokaryotic or eukaryotic cell membrane. 

The expression vector, may be pEE-tACEA3 6N J which encodes human tACE that 
lacks the heavily ^-glycosylated, 36-residue N-terminal sequence and is truncated after 
Ser 625 , thereby lacking most of the juxtamembrane stalk as well as the transmembrane 
and cytoplasmic domains. 

In a preferred aspect, the expression vector is pLEN- tACEA36g(n), where n is a set of 
numbers defining the available //-linked glycosylation sites. 

Preferably, the expression vectors are stably expressed in CHO cells as described 
previously (Eblers et ah (1996) Biochemistry 35, 9549-9559). More preferably, the 
expression vectors are pLEN- tACEA36g(l, 2, 3, 4) and pLEN- tACEA36g(l,3). 

FUSION PROTEINS 

ACE or a modulator of ACE may be expressed as a fusion protein to aid extraction 
and purification and/or delivery of the modulator of ACE or the ACE protein to an 
individual and/or to facilitate the development of a screen for modulators of ACE. 

Examples of fusion protein partners include glutathione-S-transferase (GST), 6xHis, 
GAL4 (DNA binding and/or transcriptional activation domains) and (3-galactosidase. 



It may also be convenient to include a proteolytic cleavage site between the fusion 
protein partner and the protein sequence of interest to allow removal of fusion protein 



sequences. Preferably, the fusion protein will not hinder the activity of the protein of 
interest 

The fusion protein may comprise an antigen or an antigenic determinant fused to the 
substance of the present invention. In this embodiment, the fusion protein may be a 
non-naturally occurring fusion protein comprising a substance, which may act as an 
adjuvant in the sense of providing a generalised stimulation of the immune system. 
The antigen or antigenic determinant may be attached to either the -amino or carboxy 
terminus of the substance. 

ORGANISM 

The term "organism" in relation to the present invention includes any organism that 
could comprise ACE and/or modulators of ACE. Examples of organisms may include 
mammals, fungi, yeast or plants. 

Preferably, the organism is a mammal. More preferably, the organism is a human. 
TRANSFORMATION 

As indicated earlier, the host organism can be a prokaryotic or a eukaryotic organism. 
Examples of suitable prokaryotic hosts include E. coli and Bacillus subtilis. Teachings 
on the transformation of prokaryotic hosts are well documented in the art, for example 
see Sambrook et al (Molecular Cloning: A Laboratory Manual, 2nd edition, 1989, Cold 
Spring Harbor Laboratory Press) and Ausubel et al, Current Protocols in Molecular 
Biology (1995), John Wiley & Sons, Inc. Examples of suitable eukaryotic hosts 
include mammalian cells. 

If a prokaryotic host is used then the nucleotide sequence - such as the ACE nucleotide 
sequence - may need to be suitably modified before transformation - such as by removal 
ofintrons. 

Thus, the present invention also relates to the transformation of a host cell with a 
nucleotide sequence - such as ACE or a modulator of ACE. Host cells transformed 
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with the nucleotide sequence may be cultured under conditions suitable for the 
expression and recovery of the encoded protein from cell culture. The protein 
produced by a recombinant cell may be secreted or may be contained intracellularly 
depending on the sequence and/or the vector used. As will be understood by those of 
skill in the art, expression vectors containing coding sequences can be designed with 
signal sequences which direct secretion of the coding sequences through a particular 
prokaryotic or eukaryotic cell membrane. Other recombinant constructions may join 
the coding sequence to nucleotide sequence encoding a polypeptide domain, which 
will facilitate purification of soluble proteins (Kroll DJ et al (1993) DNA Cell Biol 
12:441-53) e.g. 6-His or GlutatMone-S-transferase. 



TRANSFECTION 

Vectors comprising for example, the ACE nucleotide sequence, may be introduced 
into host cells, for example, mammalian cells, using a variety of methods. 

Typical transfection methods include electroporation, DNA biolistics, lipid-mediated 
transfection, compacted DNA-mediated transfection, liposomes, immunoliposomes, 
lipofectin, cationic agent-mediated, cationic facial amphiphiles (CFAs) {Nature 
Biotech. (1996) 14, 556), multivalent cations such as spermine, cationic lipids or 
polylysine, 1, 2,-bis (oleoyloxy)-3-(ti-imethylammonio) propane (DOTAP)-cholesterol 
complexes (Wolff and Trubetskoy 1998 Nature Biotechnology 16: 421) and 
combinations thereof. 

Uptake of nucleic acid constructs by mammalian cells is enhanced by several known 
transfection techniques for example those including the use of transfection agents. 
Example of these agents include cationic agents (for example calcium phosphate and 
DEAE-dextxan) and lipofectants (for example lipofectam™ and transfectam™). 
Typically, nucleic acid constructs are mixed with the transfection agent to produce a 
composition. 

Such methods are described in many standard laboratory manuals - such as Sambrook 
et al, Molecular Cloning: A Laboratory Manual, 2d ed. (1989) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N. Y. 




GENERAL RECOMBINANT DNA METHODOLOGY TECHNIQUES 

The present invention employs, unless otherwise indicated, conventional techniques of 
5 chemistry, molecular biology, microbiology, recombinant DNA and immunology, which 
are within the capabilities of a person of ordinary skill in the art. Such techniques are 
explained in the literature. See, for example, J. Sambrook, E. F. Fritsch, and T. Maniatis, 
1989, Molecular Cloning: A Laboratory Manual, Second Edition, Books 1-3, Cold 
Spring Harbor Laboratory Press; Ausubel, F. M. et al. (1995 and periodic supplements; 

10 Current Protocols in Molecular Biology, ch. 9, 13, and 16, John Wiley & Sons, New 
York, N.Y.); B. Roe, J. Crabtree, and A. Kahn, 1996, DNA Isolation and Sequencing: 
Essential Techniques, John Wiley & Sons; J. M. Polak and James O'D. McGee, 1990, In 
Situ Hybridization: Principles cmd Practice; Oxford University Press; M. J. Gait 
(Editor), 1984, Oligonucleotide Synthesis: A Practical Approach, M Press; and, D. M. J. 

15 Lilley and J. E. Dahlberg, 1992, Methods of Enzymology: DNA Structure Part A: 
Synthesis and Physical Analysis of DNA Methods in Enzymology, Academic Press. Each 
of these general texts is herein incorporated by reference. 

DESCRIPTION OF THE DRAWINGS 

20 

Figure 1 is a schematic presentation of wild-type tACE (tACE) and 1ACEA36 
glycosylation mutants. Glycosylation sites are shown with boxes, glycosylated 
always (black box), glycosylated partially (grey box) and unglycosylated (open box). 
The O-glycosylated region at the N-terminus of (tACE), which is absent in tACEA36, 
25 is shown with a stippled box. TM is the transmembrane domain. 

Figure 2 illustrates the expression of tACE glycosylation mutants. Proteins are 
immunodetected from detergent-solubilised cells (A) and from harvested medium (B) 
with rabbit anti-human tACE antibody (at 1:2000). The estimated protein size is 
30 indicated. Lanes contain tACE A3 6 (A3 6), untreated CHO cells (CHO) and ACE 
glycosylation mutants tACEA36-g(n) where n equals the number of sites that are 
glycosylated. 
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Figures 3 illustrates the effect of tunicamycin on the expression and processing of 
tACE. CHO cells expressing wild-type tACE are treated with the glycosylate 
inhibitor tunicamycin for up to 24 h. Cell lysate from untreated (a) and treated (b) 
cells are analysed at the indicated times by Western blotting, protein molecular 
5 weights are given. 

Figure 4 illustrates the effect of phorbol ester on the levels of ACE activity. Results 
are expressed as percentage soluble of total (soluble plus cell associated) ACE 
activity. Lanes show wild-type tACE (tACE), tACEA36 (A36) and glycosylate 
10 mutants 1ACEA36 g(n), where n gives the number of sites that are glycosylated. CHO 
cells were grown either in the absence (open bars) or the presence (filled bars) of 
phorbol 12, 13 -dibutyrate. 

Figure 5 represents the orthorhombic crystals of tACEA36. 

15 

Figure 6 schematically illustrates a general purpose computer (132) of the type that may 
be used in accordance with the present invention. The computer (132) includes a central 
processing unit (134), a read only memory (136), a random access memory (138), a hard 
disk drive (140), a display driver (142) and display (144) and a user input/output circuit 

20 (146) with a keyboard (148) and mouse (150) all connected via a common bus (152). 
The central processing unit (134) may execute program instructions stored within the 
ROM (136), the RAM (138) or the hard disk drive (140) to carry out processing of signal 
values that may be stored within the RAM (138) or the hard disk drive (140). The 
program may be written in a wide variety of different programming languages. The 

25 computer program itself may be stored and distributed on a recording medium, such as a 
compact disc, or may be downloaded over a network link (not illustrated). The general 
purpose computer (132) when operating under control of an appropriate computer 
program effectively forms an apparatus for performing aspects of the present invention. 

30 Figure 7 illustrates the overview of the testis ACE structure, (a) Ribbon diagram of 
the molecule looking down on the active site. The molecule can be divided into 2 
halves as domains I and E shown in cyan and pink respectively. The active site zinc 
ion and the lisinopril molecule are shown in yellow. The two chloride ions are shown 




as black spheres, (b) The molecular surface representation showing the active site 
groove, (c) The structure-sequence, relationship in ACE. The secondary structure 
elements (domain I-cyan; domain II-pink) follow the same colour code as in (a). The 
important residues which are involved in binding are marked- zinc ligands (yellow), 
5 chloride binding residues (Cll- light green, C12- dark green), lisinopril binding 
residues (cyan) and glycosylation sites (open boxes). 

Figure 8 illustrates details of the active site, (a) Zinc co-ordination in the native 
structure. Bound zinc ion, acetate ion (AC) and part of the butyl succinate moiety 
10 (LIG) are shown, (b) Details of lisinopril (LIS) interactions. Bound zinc ion is also 
shown. 

Figure 9 illustrates the comparison of (a) ACE and (b) Neurolysin folds. The same 
view as in Figure 7a has been retained. 

15 

The invention will now be further described by way of Examples, which are meant to 
serve to assist one of ordinary skill in the art in carrying out the invention and are not 
intended in any way to limit the scope of the invention. 

20 EXAMPLES 

Example 1 

Materials & Methods 

25 

Materials 

Endoproteinase Lys-C and Asp-N, peptide-iV-glycosidase F (PNGase F), 
endoglycosidase H (endo H), neuraminidase, and O-glycosidase are purchased from 
Roche Biochemicals. Cyanogen bromide, trifluoroacetic acid, and calibration 
30 standards (angiotensin, insulin, myoglobin, oxidised insulin B-chain, and TPCK- 
treated trypsin) are from Sigma Chemical Co. Glycosylation inhibitor N- 
butyldeoxynojirimycin (NB-DNJ) was a kind gift from Dr. F. Piatt, University of 
Oxford, UK. 
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Comtruction of Expression Vectors 

pEE-ACEA36NJ encodes human tACE that lacks the heavily O-glycosylated, 36- 
residue N-tenninal sequence and is truncated after Ser 625 , thereby lacking most of the 
juxtamembrane stalk as well as the transmembrane and cytoplasmic domains, and is 

5 constructed as follows. The 5' half of the ACE cDNA in the plasmid pLEN-ACE- 
JMA24 is excised by digestion with BamHI and Nhel and replaced with the similarly 
digested fragment from plasmid pLEN-ACEA36N. pLEN-ACE-JMA24 has an 
•engineered EcoRI site at nucleotide (nt) 1984 in the ACE cDNA. The sequence 
between nt 1854 (the start of the unique Bell site) and nt 1990 (the end of the codon 

10 for Ser 625 ) in the native ACE cDNA are amplified by the polymerase chain reaction, 

625 

using a 3' primer that contains two stop codons (TAA and TAG) after the Ser 
codon, followed by an EcoRI site. The recombinant sequence is inserted into the 
pLEN-ACE_36N/JMA24 hybrid cut with Bell and EcoRI, to generate pLEN- 
ACEA36NJ. The ACEA36NJ coding sequence is excised by digestion of unique Xbal 
15 (generated after first subcloning in pBluescript) and EcoRI sites and inserted into the 
polylinker of the expression vector pEE14 ? to generate pEE-ACEA36NJ. 



Cell Culture and Transfections 

CHO-K1 cells stably transfected with pLEN-ACE glycosylation mutants are grown 
20 and maintained in standard media (50% Ham's F-12/50% DME medium 
supplemented with 20 mM Hepes, pH- 7.3) containing 2% fetal bovine scrum (heated 
to 65 n C for 1 5 mins before ul S) and 40 |xM Zn Cl 2 • In addition, native CHOK1 cells 
are cotransfected with pEE-ACEA36NJ (10 \ig) and pSV2NEO (1 |xg) by the calcium 
phosphate precipitate method and clones stably resistant to G418 (Geneticin, Gibco- 
25 BRL) are selected and assayed for ACE activity, by procedures detailed previously 
(9,1 1). Clones stably expressing pEE-ACEA36NJ are further selected for resistance to 
methionine sulfoximine and then amplified, as described (Davis, SJ, Davies, EA, 
Barclay, AN, Daenke, S, Bodian, DL, Jones, EY, Stuart, DI, Butters, TD, Dwek, RA, 
and van der Merwe, PA (1995) J Biol Chem 270, 369-375). Methionine sulfoximine- 
30 amplified cells are grown first in GMEM-10 (Gibco-BRL) containing 10% dialyzed 
fetal bovine serum (FBS) (Gibco-BRL) and 1.5 mM NB-DNJ for 3 days and then 
refed with GMEM-10, 5% dialyzed FBS, 2 mM NB-DNJ. This medium is changed 
twice over a period of 9 days before harvesting. 




Enzyme Purification 

Soluble, recombinant tACE (wild-type, ACEA36NJ and ACE glycosylation mutants), 
is purified from conditioned media by affinity chromatography on a Sepharose-28- 
lisinopril affinity resin. The protein is quantitated by amino acid analysis and assayed 
for activity using the substrate hippuryl-L-histidyl-L-leucine , as described (Ehlers, 
MRE, Chen, Y-N, Riordan, JF (1991) ProcNatl Acad SciUSA 88, 1009-1013). 

Deglycosylation of A CE 

tACEA36NJ (12.5 nmol) purified from cultures treated with NB-DNJ is digested with 
endo H (30 mU) in 100 mM sodium phosphate, 0.1 mM ZnCl 2 , 1% BSA, pH 6.0 for 
16 h at 37°C. The endo H-treated ACE is passed through a lectin affinity column 
consisting of equal parts of concanavalin A, wheat germ, and lentil lectin, after 
equilibration with 20 mM Tris-HCl, 0.5 M NaCl at pH 7.4. The deglycosylated ACE 
is collected in the flowthrough. Free oligosaccharides and any other impurities are 
removed from the flowthrough fraction by a final lisinopril-Sepharose affinity 
chromatography step. The homogeneity of the tACEA36NJ after deglycosylation is 
confirmed by SDS-PAGE on a 4-20% acrylamide gel and MALDI-TOF mass 
spectrometry. 

Construction of Glycosylation Mutants 

Minimally glycosylated isoforms -of human testis ACE are constructed by the removal 
of a combination of N-linked glycosylation sites. Glycosylation is abolished by 
site-directed mutagenesis of the site of attachment in the recognition sequon 
(Asn-X-Ser/Thr) through a conserved transition of the Asn to a Gin residue. A 
truncated form of tACE (tACEA36 lacking the first N-terminal 36 residues is used for 
the construction of all mutants. tACEA36 cDNA is divided into four fragments and 
introduced into pGEM-llZf(+) to facilitate site-directed mutagenesis. An EcoATDl 
site is introduced into pGEM-HZf(+) for cloning of the first and second tACE A3 6 
fragments. A 146 bp BamlH/Nhel digested pBR329 fragment that contains an 
Eco47m site is ligated to BaniHUXbal digested pGEM. The following mutagenic 
oligonucleotides are used for altering the consensus sites for iV-linked glycosylation: 
gl-F, g2-F, g3-F, g4-F, g5-F, g6-F, gl-R, g2-R and g3-R, with the number referring to 
each complementary recognition sequon. The mutagenic oligonucleotides encode the 



• # 

Asn to Gin transition (underlined) at the glycosylation sites and a silent mutation that 
alters a restriction enzyme recognition site (highlighted in italics). Mutated residues 
are indicated by bold type: 



Forward primers: 

gl-F (40-mer): 5'-GAGGCCAA 7TGGAACTACAACACCCAGATCACCACAG AG-3 ' 
g2-F (36-mer): 5'-ATGCAAATAGCCCAGCACACCCrr/L4GTACGGCACC-3' 
g3-F (40-mer): 5'-GAAGTTTGATG7T/L4 CC AGTTGCAGCAGACCACTATCAAG-3 ' 
g4-F (30-mer): S'-GTOTGCCACCCGCAAGGraGCTGCCTGCAG-S' 
g5-F (36-mer): 5'-CCGTGCCTCCTG^7TCTGGCAjGAAGTCGATGCTGG-3' 
g6-F (36-mer): 5'-ACGGGCCAGCCCCAGATG^GCGCITCGGCC-3' 

Reverse primers are complementary to the forward primers: 

gl-R (40-mer): 5'-CTCTGTGGTGATCTGGGTGTTGTAGTTCCA ATTGGCCTCG-3' 
g2-R (36-mer): 5'-GGTGCCGTACTTAAGGGTGTGCTCGGCTATTTGCAT-3' 
g3-R (40-mer): S'-CTTGATAGTGGTCTGCTGCAACTGGTTAACATCAAACTTC-S 1 

In the first fragment a cDNA-containing mutation at one site served as a template for 
the mutagenesis of the second site thus generating five variants with either one or two 
sites out of the first three eliminated. The fourth, fifth and sixth sites (N155, N337 and 
N586) are eliminated from fragments 2, 3 and 4 respectively using primers g4-F, 
g5-F, g6-F. The nucleotide sequence of each fragment is confirmed by sequencing to 
ensure that only the desired mutation has been created. The four fragments are 
reassembled to produce eight tACE A3 6 glycosylation mutants (Fig 1). 

The mutants are introduced into the mammalian expression vector, pLEN, to facilitate 
the production of underglycosylated tACEA36 protein. The expression vectors pLEN- 
tACEA36g(n), where n is a set of numbers defining the available ^-linked 
glycosylation sites, are stably expressed in CHO cells as described previously (Ehlers 
et al. (1996) Biochemistry 35, 9549-9559). Similarly, wild-type tACE (tACE-wt), 
retaining its 36-residue N-terminus as well as the transmembrane region and 
juxtamembrane stalk, is stably expressed in CHO cells. A deletion mutant, 
tACE A3 6N J, truncated after Ser625, and thus lacking the cytoplasmic and TM 
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domains as well as most of the juxtamembrane stalk, is" constructed (Yu et al. (1997) 
J. Biol Chem. 272, 3511-3519). The vector pEE14-ACEA36NJ is transfected into 



CHO cells and clones stably expressing the mutant are amplified using methionine 
sulfoxamine, as described (Yu et al (1997) J, Biol Chem, 272, 3511-3519). 



Inhibition of tACE glycosylation in presence of tunicamycin 

CHO cells expressing tACE-wt are grown to confluence in 6-well plates and induced 
overnight in 2 % medium in the absence and presence of 5 ^tg/ml tunicamycin. Fresh 
medium is added and cells axe grown in the absence and presence of 5 (xg/ml 
10 tunicamycin. At 6, 8, 12, 16, 20 and 24 hours detergent-solubilised cell samples are 
collected. After separating the proteins on 10 % SDS-PAGE they are immunodetected 
by Western blotting. 

Western Blot Analysis of Glycosylation Mutants 

15 tACE-wt, tACEA36, tACEA36-gl, -g2, -g3, -gl2, -gl3, -g23, -gl23 and -gl234 are 
immunodetected by Western blotting of cell lysates and harvested medium from 
transfected CHO cells. Proteins are separated on 10 % SDS-PAGE and transferred to 
nitrocellulose membrane (Hybond-C, Amersham). The membrane is probed with a 
polyclonal rabbit anti-human tACE antibody. The membrane is developed using the 

20 ECL chemiluminescence kit (Amersham) and visualised on autoradiographic film, as 
per instructions. 

Analysis of Release Kinetics and Cleavage Sites 

After selection for stable transfectants, kinetic analyses of rates of accumulation of 
25 soluble (released) ACE activity and changes in membrane-bound ACE activity are 
performed in the presence and absence of 1 pM phorbol 12,13 dibutyrate (Schwager 
et al (1998) Biochemistry 37, 15449-15456). Identification of the stalk cleavage sites 
and analysis of the C-terminal glycosylation site in the released (soluble) protein is 
carried out using limited proteolysis and MALDI-TOF mass spectrometry by methods 
30 described previously (Schwager et al (1998) Biochemistry *37, 15449-15456). 



5 



Determination of kinetic constants for tACE hydrolysis of Hippuryl-L-histidyl-L- 
leucine (Hip-His-Leu) 
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The enzyme is purified from harvest medium via lisinopril affinity chromatography 
and rates of substrate hydrolysis are deteimined. Assays are performed in 100 mM 
potassium phosphate pH 8.3 containing 300 mM NaCl. Initial velocities are 
calculated over a range of Hip-His-Leu concentrations (0.2-5.0 mM) under initial rate 
conditions and fit to the Michaelis-Menten equation. K m and V max values are 
determined by non-linear regression analysis. Turnover numbers (k^) and specificity 
constants (W^m) are determined using a calculated molecular mass of 100 kDa. 

Mass Spectrometry 

Mass spectrometry is used to verify the glycan occupancy of some of the 
glycosylation sites of the mutant proteins. All mass spectra are obtained on a 
MALDI/TOF/MS instrument (Voyager-Elite Biospectronietry Workstation, 
PerSeptive Biosystems, Inc.). A nitrogen laser (337 nm) is used for desorption 
ionization. Measurements are carried out either in the linear or reflectron mode with 
mass accuracies of 0.1% and 0.01%, respectively. Spectra are collected over a 
hundred laser shots. 

Typical matrices used in these experiments are 3,5-dimemoxy-4-hydroxycinnamic 
acid (sinapinic acid) and a-cyano-4-hydroxycinnamic acid (Aldrich). About 1 (xl of 
sample solution was mixed with 2 ul of the matrix solution (10 mg/ml in 50% v/v 
CH 3 CN and H 2 0). A 0.5 ul volume (containing 1-10 pmol of peptide or peptide 
mixture) of the above solution is loaded on the sample plate and allowed to dry. 



Oystallisation 

The purified tACE- ACEA36NJ and ACE glycosylation mutants are stored at -20 °C 
in 10 mM HEPES and 0.1% PMSF. Extensive crystallisation trials using 
commercially available crystal screen conditions (Hampton Research) are tried. In 
addition ammonium sulphate, PEG and MPD matrices are also tried. The best crystals 
for these proteins are grown at 16 °C by the vapour diffusion hanging drop method by 
mixing 2 ul of the protein solution at -11.5 mg/ml in 10 mM HEPES and 0.1% PMSF 
with an equal volume of a reservoir solution containing 15 % PEG 4000 (Fluka), 50 
mM CH 3 COONa.3H 2 0 (Sigma Chemical Company) pH 4.7 and 10 uM ZnS0 4 .7H 2 0 
(Aldrich Chemical Company). Crystals usually appear within 2 weeks and grow to 



their maximum size after about a~month. The crystals belong to P2i2x2i space group, 
with 1 molecule in the cry stallo graphic asymmetric unit and some 49 % of the crystal 
volume occupied by the solvent (cell dimensions: a= 56.47 A, b=84.90 A, c=133.99 
A, oc=90°, p=90° and y=90°). 1ACE-ACEA36NJ crystals diffract to 2 A while tACE- 
ACEA36NJ-gl3 and tACE- ACEA36NJ-gl234 crystals diffract to 3.0 A and 2.8 A 
respectively on a Synchrotron Radiation Source. All crystals are isomorphous. 

The tACE-lisinopril (lisinopril.dihydrate) inhibitor complex is obtained by growing 
the crystals in the presence of inhibitor. In these experiments the protein solution is 
mixed with 1 0 mM of the inhibitor and mixed with an equal volume of the reservoir 
solution before setting up the crystallisation. Single crystals belonging to the P2i2i2i 
orthorhombic space group (isomorphous with the native crystals) suitable for 
diffraction work appear after about 4 weeks. 

X-ray diffi-action data collection 

Before data collection, all crystals are flash-cooled at 100 K in a cryoprotectant 
containing 15 % PEG 4000, 50 mM CH 3 COONa.3H 2 0 pH 4.7, 10 \xM ZnS0 4 .7H 2 0 
and 25% glycerol with and without respective entities - such as ACE inhibitors. All 
the X-ray data are collected at 100 °K using a Synchrotron Radiation Source (ESRF- 
Grenoble, France). Multi-wavelength anomalous dispersion (MAD) data are 
collected with crystals of tACE-lisinopril complex at peak of Zn K-edge (1.2825 A), 
inflection (1.28322 A) and remote (0.95373 A) wavelengths. An anomalous dataset at 
long wavelength (1.7712 A), the closest possible wavelength to Sulphur K-edge is 
also collected. All other heavy atom data sets are collected at the SRS-Daresbury 
(UK) and processed with anomalous signal. Out of more than 32 soaks with different 
heavy atoms, only three are found to be useful in phasing. These three (Pt, Pd & Os) 
derivatives are prepared by soaking the tACE-lisinopril complex crystals for - 10 to 
60 minutes in the presence of 1-5 mM heavy atom solutions. Raw data images are 
indexed and scaled using the DENZO and SCALEPACK modules of the HKL suite 
[Otwinowski M and Minor W. (1997) Methods Enzymol 276, 307-326]. 
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Expression of glycosylate mutants in CHO cells and the kinetics of release 

The role of //-linked glycosylation in the expression and processing of tACEA36 is 
investigated to establish the minimal glycosylation requirements for the expression of 
correctly folded enzymatically-active protein. The eight glycosylation mutants 
constructed are transfected into CHO cells and the effect of deglycosylation on the 
activity, expression and processing of tACEA36 is assessed by enzyme assays, 
immunodetection and cleavage site analysis. 

Western blot analysis of cellular and soluble 1ACEA36 glycoforms reveal proteins 
with increased mobility relative to glycosylated wild type tACE (tACE-wt) (Fig 2) 
and which is dependent upon on their degree of glycosylation. Furthermore, 
differences are detected in the expression and processing of the various glycofonns. 
Comparatively low levels of tACEA36-g2 protein in cellular extracts (Fig. 2A), and 
15 its absence in the harvest medium (Fig. 2B), may suggest intracellular degradation 
and/or deficient processing of the protein. Whilst tACEA36-gl demonstrates 
comparable levels of expression to the other proteins (Fig. 2A), processing and 
solubilisation appears to be less efficient than tACEA36-g3, -gl23, and -gl234 (Fig. 
2B). Tunicamycin partially inhibits the formation of mature, glycosylated tACE-wt. 
20 Western blot analysis of cellular tACE-wt in the absence of tunicamycin reveals a 
105 kDa mature, glycosylated protein (Fig 3). In the presence of tunicamycin an 
additional smaller, deglycosylated 85 kDa protein is detected (Fig 3). 

The K m and values obtained for the hydrolysis of Hip-His-Leu by tACE (Table 3) 
25 are in agreement with those previously published for the C-fragment of the human 
endothelial isoform (K m = 2.0 mM) (10). Differences in the kinetic constants of 
glycosylated and under-glycosylated 1ACEA36 mutants are negligible and are not 
considered sufficiently different to reflect major alterations in the conformation and 
activity of the protein. Glycosylation of tACE A3 6 at one (tACEA36-g3) or two 
30 (tACEA36-gl3) N-terminal glycosylation sequons is sufficient in mamtaining the 
functional integrity of the enzyme. Furthermore, treatment of cells expressing 
tACEA36NJ with the glucosidase I inhibitor NB-DNJ, do not alter the kinetic 
properties of the expressed enzymes. 



Example 3 

Kinetics of release and analysis of juxtamembrane cleavage sites 

Independent transfections of CHO cells with tACEA36-gl, -gl2, -gl23 and -gl234 
are performed which yield consistent results for each mutant (Fig. 4). All 
glycosylation mutants show similar levels of phorbol ester stimulation. Thus, N- 
linked glycosylation does not appear to affect solubilisation of the membrane- 
anchored enzyme. Further protein analysis is performed to: 1) identify the C-terminaJ 
peptide to determine whether cleavage of tACEA36-gl, -gl3 and -gl234 occurrs at 
the same residues as tACE and 2) to investigate the glycosylation status of the seventh 
recognition sequon which lies seven residues proximal to the tACE cleavage site. The 
mass spectra of endoproteinase Lys-C digested peptides of tACEA36-gl, -gl3 and 
-gl234 reveal three |M+H] + ions of m/z 1698.7, 1690.5 and 1690.9, respectively 
(Table 2), which are in close agreement with the theoretical mass of the C-terminal 
peptide (calculated m/z 1 690. 8). 

Thus, tACEA36-gl, -gl3 and -gl234 proteolysis occurs between Arg627 and Ser628, 
which is at the same site as tACE-wt. The spectra of the mutant tACEA36NJ reveals a 
[M+H] + ion at m/z 1463.1 which corresponds to the calculated mass of the peptide 
Leu614 to Ser625 confirmi n g that truncation occurrs at Ser625 and that there is no 
further limited proteolysis of the C-terminus (Table 2). 

Example 4 

Structure of ACE 

The crystal structure of the native ACE enzyme is determined to 2.0 A resolution. 
The structure from residues 71-648 adopts an overall ellipsoid shape (dimensions 
~72A x 57A x 48 A) with a central groove that extends for about 30A into the 
molecule and effectively divides the protein into two "domains 55 (labelled I and II, 
Fig. 7a,b). On top of the molecule there is a N-terminal "flap 55 that allows access to 
the active site cleft. Like the recently determined zinc-peptidase structure-neurolysin, 



'A 



ACE is also bristling with helices and is comprised of twenty a-helices (with almost 
equal distribution of helices in both domains) and seven 3io-helices (Fig. 7c). This 
makes the molecular architecture very compact and rigid as observed from the 
thermal parameters (crystallographic B-factors) for the structure. The only p-structure 
which accounts for 4% of all residues occurs as six relatively short strands, two of 
which are located near the active site (Fig. 7a). Hie boundaries of the groove are 
provided by domain II (cd3, al4, al5, al7, P4) with part of the domain I forming 
the flap (al, a2, a3). In addition the base of the scaffold is formed by helices oc6 and 
a8 and helix al5 serves as an anchor. 

Eight C-terminal residues are disordered in the crystal structure and most of this 
flexible region constitutes the unconstrained juxtamembrane region distal to the wild- 
type ACE cleavage site. The a-helix from Ala 620 to His 641(a20) defines the C- 
terminal boundary of the ectodomain and this is in agreement with previous ACE 
mutagenesis and cleavage-secretion studies (Ehlers et al, 1996; Chubb et al, 2002). 
Residues Ser 466-Gly 469 are disordered hi the structure. Five hundred and four 
ordered water molecules were identified in the native structure. All six glycosylation 
sites (gl-6, Fig. 7c) are exposed and gl site is closest to the N-terminus of the 
molecule. Sketchy density was observed for N-linked carbohydrates in all the 
glycosylation sites (Fig 7c) and modelled as N-acetylglucosamine (NAG) in the 
native structure. The co-ordinates of underglycosylated tACEA36NJ ACE are shown 
in Table A. 



Example 5 

Zinc co-ordination 

ACE belongs to the gluzincin family of metalloproteases (Hooper, 1994). Zinc is 
known to be an important catalytic component of ACE. In the structure, the active site 
helices al3 and al4 define the substrate binding cleft of this peptidase. One highly 
ordered zinc ion (B-factor=16.34A 2 ) is bound at the active site. The helix ccl3 
contains the HEXXH zinc binding motif (known to be important for the catalytic 
activity of ACE, Wei et al, 1991), with its two zinc coordinating histidines (His 414 




and His 418) and Glu 442 (from ocl4 helix) acts as the third ligand (Fig 7c). 
Additional coordination is provided by an acetate ion (from the crystallization 
medium) bound at the active site of the native enzyme (Fig. 8a). It has been reported 
previously that the recruitment of zinc ion in ACE catalysis is analogous to that in 
5 thermolysin (WilUams et at, 1994; Corvol et al t 1995) and our structural data shows 
that the zinc binding sites in both proteins are indeed very similar (r.m.s. deviation 
1.52A) except the acetate ion which is replaced by a water molecule in the 
coordination sphere in thermolysin. The active site pocket in ACE is occupied by 3 
ordered water molecules. In addition, a stretch of clear electron density (at 3a level) 
10 was observed (-5 A away from the zinc ion) at the active site which is interpreted as 
part of butyl succinate (Fig 8a). 

Example 6 

Activation of ACE by chloride ions 

15 

Substrate hydrolysis by ACE is activated by chloride anions and, whereas some 
substrates can be cleaved in the absence of chloride, it is an absolute requirement for 
angiotensin I (Bunning and Riordan, 1983; Shapiro et al, 1983). Our structure 
revealed the location of two chloride ions (Fig. 7a, c). The first chloride ion (Cll, 

20 20.7A away from the zinc ion) is bound to three ligands, Arg 520 (NH1), Arg 217 
(NE), Trp 516 (NE1) and water and is surrounded by a hydrophobic shell of four 
tryptophans. The second anion (C12, 10.4A away from the zinc ion) is bound to Arg 
553 (NE) in agreement with previous report showing Arg 1098 (the analogous Arg 
residue in the C-domain of somatic ACE) is critical for chloride dependence of ACE 

25 activity (Liu et al, 2001). Tyr 255 and a water molecule are the other two C12 ligands. 
The two chloride ions (separated by 20. 3 A) in ACE are bound almost in a triangular 
conformation —in accordance with the near equatorial coordination and 
accommodation of planar trigonal anions in oc-amylase where the chloride ions are 
involved in the allosteric activation (Aghajari et al, 2002). In addition, the anchoring 

30 of Domain II on Domain I scaffold appears to be mediated by the chloride ions. 

Mutating Arg 1098 to a lysine resulted in a 100-fold reduction in chloride binding 
affinity as well as a decrease in substrate affinity (Liu et al, 2001). This might be due 




to the contact with the NE atom of arginine as opposed to an NH1 or NH2 atom with 
lysine which would permit free rotation of the basic terminal nitrogens. Recent studies 
suggest that the affinity of chloride binding to ACE is directly related to the 
interactions between the chloride and enzyme as well as the substrate (Liu et al, 
2001). However, based on the location of both chloride ions in the 3D structure make 
the latter interaction unlikely. Nevertheless, there is a van der Waals contact between 
C12 and Pro 438 (al4) that could be responsible for maintaining the active site cleft in 
a conformation which favours substrate accessibility and binding. Indeed ccl4 contain 
the third zinc co-ordinating ligand Glu 442. Thus, the chloride interactions likely 
cause the enzyme to adopt conformations that permit higher affinity substrate binding 
and that it is not important in substrate binding per se. 



Example 7 

Structure determination and refinement of an ACE-lisinopril complex 

The crystal structure of tACE bound to the potent lisinopril inhibitor (K;=27 x 10 10 M, 
Wei et al, 1991) complex is determined by a combination of MAD and MORAS 
(Multiple Isomorphous Replacement with Anomalous Scattering) procedures. The 
position of zinc atom is unambiguously identified using the anomalous difference 
Patterson maps calculated Using diffractions data at peak wavelength. The MAD 
phases thus obtained are not very strong and thus additional phase information is 
obtained using MIRAS procedures with three- Platinum (KiPtCU), Palladium 
(K 2 PdCU) and Osmium (OsCl 3 ) heavy atom derivatives. The identified Zn site is 
used to obtain the starting phases in each derivative. Double difference Fourier maps 
calculated using FFT routine in CCP4 program [Collaborative computational project 
Number 4. The CCP4 Suite: Programs for Protein Crystallography (1994) Acta 
Crystallogr. D 50, 760-763] gave the first major binding site and the phases from the 
combined Zn and first major site are used to get additional major/minor sites for each 
derivative. All heavy atom binding sites and the Zn site are refined to 2.8 A 
resolution using the program MLPHARE [Collaborative computational project 
Number 4. The CCP4 Suite: Programs for Protein Crystallography (1994) Acta 
Crystallogr. D 50, 760-763] and SHARP [De La Fortelle, E. & Bricogne, G. 



Maximum-likelihood heavy-atom parameters refinement in the" MIR and MAD 
methods (1997) Methods Enzymol. 276, 472-494]. The overall figure of merit from 
SHARP is 0.38 (at 2.8 A resolution) and is improved to 0.89 (at 2.0 A) by iterative 
solvent flattening, phase combination and phase extension with the program 
SOLOMON [Abrahams, J. P. & Leslie, A. G. W. Methods used in structure 
determination of bovine mitochondrial Fl ATPase. (1996) Acta Crystallogr. D 52, 
1 10-1 19]. Model building is carried out manually using the program O [Jones, T. A., 
Zou, J. Y., Cowan, S. W. & Kjeldgaard, M. Improved methods for building protein 
models in electron density maps and the location of errors in these models (1991). 
Acta Crystallogr. A 47, 110-119]. Refinement of the model is carried out using the 
program CNS [Briinger, A. T. et al. Crystallography & NMR System: A new software 
suite for macromolecular structure determination. (1998) Acta Crystallogr. D 54, 905- 
921]. During the final stages of refinement water molecules, zinc ion and the 
inhibitor molecule are inserted in the respective structure. Both the native and 
inhibitor complex structures are refined at 2.0A resolution with -94 % residues in the 
maximum allowed region and no residue in the disallowed region of the 
Ramachandran map. The final structure of the tACE and ACE- lisinopril complex has 
an R fr6e [Brunger, A. T. Free R value: a novel statistical quantity for assessing the 
accuracy of crystal structure. (1992) Nature 355, 472-475] of 22.08 and 21.88 %, and 
a final R^t of 18.29 and 18.14 % respectively. 

i 

Highly ordered binding of the inhibitor (overall B-factor 15.26A 2 ) in an extended 
conformation with the aromatic phenyl group stretching towards the flap (Fig 7a) and 
its lysine side chain parallel to the HEXXH motif (Table 1). Upon complex formation 
no significant rearrangement of active site residues was observed. The Pi carboxylate 
of lisinopril which is substituted for the normal substrate scissile amide carbonyl is 
well positioned to bind to the active site zinc (located between subsites Si and Si'). 
Thus lisinopril provides a coordinating atom (provided by an acetate ion in the native 
structure, Fig 8a), while the other three (two histidines and a glutamic acid) being the 
same as in the native structure (Fig. 8b). The phenyl ethyl group binds to the Si 
subsite and the lysyl amine to Glu 193 via van der Waals interactions at the Si' subsite 
of ACE. Surprisingly, the C-terminal carboxylate which is thought to interact with a 
positively charged arginine residue instead binds to a lysine (Lys 542) as well as Tyr 
551. The high affinity is attributable to the presence of the S-configuration aromatic, 
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aralkyl or aliphatic groups able to bind to the Si site (Brenner et aL, 1990). The 
contact residues for the lisinopril complex are shown in Table 1 . The co-ordinates of 
the underglycosylated 1ACEA36NJ ACE-lisinopril complex are show in Table B. 

5 Example 8 

The homology of ACE with other metallopeptidases 

ACE belongs to the M2 family of zinc binding metallopeptidases which falls under 
10 the umbrella of the MA clan. Outside the HEXXH metal coordinating signature 
sequence of this family there is little sequence similarity between ACE and other 
members of the family. Surprisingly, peptidases such as thermolysin (MA clan, M4 
family) and carboxypeptidase A (MC clan, M14 family) which have been used in 
comparative molecular field analysis and 3D quantitative structure-activity 
15 relationship studies (Waller & Marshall, 1993) showed no structural homology with 
ACE. 

Surprisingly, structural comparison of ACE with other protein structures using the 
DALI server (Holm and Sander, 1999) showed significant structural homology with 

20 that of neurolysin (Brown et al s 2001), a protein involved in neurotensin metabolism 
(Fig: 3 a, b). Neurolysin is a member of the M3 family of thirnet oligopeptidascs and 
like ACE, belongs to the family of metallopeptidases bearing the HEXXH active site 
motif (Rawlings and Barrett, 1995; Brown et al, 2001). It also comprises of an 
abundance of a-helices with the P-structure accounting for only 6% of all residues. 

25 The two proteins do not exhibit any amino acid sequence similarity (close to random 
score), yet when the two structures are optimally superimposed (using DALI server), 
there is noticeable match with an r.m.s. deviation of 4.0A for 143 C a atoms. It appears 
that the core structure for the two proteins are highly similar with significant 
differences in loops on the outer surface in the case of neurolysin (Fig. 3a,b). The 

30 striking similarity also extends to the active site region in neurolysin consisting of a 
deep narrow channel that divides the molecule into two halves. It has been speculated 
that using the flexible secondary structure elements in the active site cavity, the 
neuropeptidase can effectively cleave a variety of small peptides. Likewise in ACE, 
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the geometry of the active site groove clearly accounts for ACE's inability to 
hydrolyse large, folded substrates. Furthermore, the enzyme's preference for 
oligopeptide substrates of about thirteen residues or less suggests that the substrate 
does not have the same freedom to extend outside of the channel during catalysis. 

5 

Example 9 

Substrate specificity 

10 The geometry of the active site channel clearly accounts for ACE's inability to 
hydrolyse large, folded substrates. " Furthermore, the enzyme's preference for 
oligopeptide substrates of about thirteen residues or less suggests that the substrate 
does not have the same freedom to extend outside of the channel during catalysis. 

^ 15 Example 10 



Crystallization of ACE mutants 



tACEA36-gl234 and tACEA36-gl,3 mutants are successfully crystallised under 
20 similar conditions as tACEA36NJ protein (Fig 5). Preliminary diffraction experiments 
.show that tACEA36-gl234 crystals diffract to at least 2.8 A and tACEA36-gl,3 
crystals diffract to 3.0 A respectively and these crystals are isomorphous with 
tACEA36 crystals. 

25 CONCLUSIONS 

The niinimal glycosylation requirements for the expression and processing of 
enzymatically-active human tACE are determined by progressive deglycosylation. To 
investigate which sugars are vital for tACE expression, a series of mutants are 
30 constructed that contain a limited number of sites for iV-linked glycosylation. N- ■ 
glycan recognition sequons within the protein sequence are disrupted by mutagenesis 
of the Asn to a Gin within the recognition sequon. 
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The removal of a large percentage of glycans does not hamper the expression of 
enzymatically-active 1ACEA36 and the presence of glycans at two of the three N- 
terminal sites is sufficient to produce protein that retain the kinetic properties of the 
native enzyme. Furthermore, the minimum N-lioked glycosylation that allowed for 
5 enzymatically-active tACE to be expressed is the presence of glycans at the first or 
third site. Glycosylation at either of these sites appears to be sufficient for protein 
folding, whereas glycosylation of the second site only, results in rapid intracellular 
degradation. This is in agreement with the glycosylation requirements of active rabbit 
ACE expressed in mammalian cells and yeast which also has a minimal requirement 
10 of at least one of the three N-tenninal sites (3 a). 

In the construction of the tACE A3 6 glycosylation mutants, the seventh site which is 
not present in rabbit tACE is not targeted as it has been shown to be unglycosylated 
(7a). With the removal of the glycosylated sequons, it is possible that this site may be 
15 glycosylated in order to compensate for the loss of oligosaccharide chains. However, 
this is not the case and that even when there is only one A^-linked oligosccharide, this 
does not occur. 

There is a preference for oligosaccharides to be attached to sequons containing 
20 threonine residues rather than serine residues. This is observed in human tACE, 
where all sequons with a serine in the third position are partially glycosylated and 
those with threonine are fully glycosylated, except for the seventh site, which has a 
threonine in the third position (7a). 

25 The fact that glycosylation at the seventh site appears to be occluded by the presence 
of the tryptophan, rather than the structure of the juxtamembrane region, is supported 
by the findings that a tACE deletion mutant, tACE-A6, which generates a novel Asn- 
Arg-Ser sequon adjacent to the wild-type cleavage site, is glycosylated (14a). The 
extended conformation of oligosaccharides may prevent access of the secretase at this 

30 point resulting in displacement of the wild-type cleavage site 13 residues towards the 
C-terminus (14a). 
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CHO cells expressing wild-type tACE (tACE-"wt) are exposed to tunicamycin, which 
inhibits N-linked glycosylation, to determine the effect of complete deglycosylation 
on the expression and activity of tACE over time. Cellular tACE-wt activity in cells 
treated with tunicamycin is approximately three-fold lower than that of untreated cells 
(data not shown) and a smaller immature form of the protein is observed by Western 
blot analysis. These results are consistent with the interpretation mat the smaller 85 
kDa-protein is the deglycosylated form and the 105 kDa-protein, the mature 
glycosylated form (7a), (9a), (14a). Thus, -tunicamycin inhibits the formation of 
mature glycosylated tACE-wt, to a certain degree, resulting in the accumulation of 
unglycosylated tACE-wt in the cell, without a significant decrease in the total amount 
of protein compared to untreated cells. Our data suggest that unglycosylated tACE-wt 
is not solubilised and that it requires the presence 6f N-glycans for proper expression, 
maturation and processing. 

The enzymatically-active tACE-wt present in cells treated with tunicamycin is most 
likely the product of a pool of glycosylated tACE-wt, which is either produced prior 
to tunicamycin treatment or is resistant to tunicamycin. Tunicamycin may therefore 
not be completely effective in CHO cells, whereas in HeLa cells, Kasturi et al. found 
that tunicamycin-treatment resulted in the generation of deglycosylated protein that is 
degraded intracellularly within three-hours, with no soluble tACE being detected (5a). / 
Thus, it appears that the effect of tunicamycin on tACE glycosylation appears to be 
cell-specific. 

Enzymatic removal of ^-linked glycans is used to produce crystals of CD2 
glycoprotein that diffracted to 2.5 A (15a). However, crystallisation trials with 
deglycosylated tACEA36NJ that has a single GlcNAc residue attached to the 
asparagine residue of the ^-linked site only produced needle-like crystals that are not 
suitable for further diffraction studies. Surprisingly, tACEA36NJ expressed in the 
presence of NB-DNJ, but retaining the simple high mannose oligosaccharides 
(Glc 3 Man 7 GlcNAc 2 ) yielded the best crystals for X-ray diffraction studies and 
subsequent successful 3D structure determination at 2.0A resolution. 
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Interestingly, the presence of the membrane anchor increased the extent of human 
CD 5 9 glycan processing (16a). Possible reasons for this difference are, that the 
ectodomain is held closer to the glycan-processing enzymes and/or the longer 
exposure to the glycosyltransferases. The oligosaccharide processing of soluble 
tACEA36NJ protein may also differ from the membrane-bound form however, 
expressing the protein in the presence of a glucosidase inhibitor would prevent the 
production of complex oligosaccharides. 

Furthermore, the kinetic properties of this mutant are not influenced by NB-DNJ 
treatment. Similarly, Yu at aL reported identical K m values for glycosylated and 
deglycosylated forms of tACEA36NJ, in agreement with that of tACE-wt, using 
furanacryloyl-Phe-Gly-Gly as a substrate (7a). Glucosidase inhibitors, at 
concentrations required to block trimming of the terminal glucose residues and 
subsequent complex oligosaccharide formation, can affect cell maturation and 
apoptosis (Misago M, Tsukada J, Fukuda MN, Eto S (2000) Biochem Biophys Res 
Commun. 269, 219-225). This treatment also results in a decrease in protein 
expression (unpublished data). Thus, mutants lacking some of the C-terminal N- 
linked glycans are Used with the objective of producing protein with a minimal 
number of oligosaccharides that would crystallise in a reproducible fashion. 

Crystals of glycosylation mutants gl3 and gl234 have been grown under similar 
condition as in the case of tACEA36NJ. The glycosylation mutant crystals seem to 
grow faster than tACEA36NJ crystals. However, they are smaller in size in the first 
instance, but have been proven to be suitable for diffraction work. 

The determination of the 3D structure of ACE and ACE-lisinopril complex may be 
effectively used for the development of novel, highly selective ACE modulators 
targeted, for example, to either the N or the C domain, by structure-based rational 
drug design. This may produce a new generation of ACE inhibitors with the potential 
for greater efficacy, fewer side effects and treatment of new indications (e.g. 
polycythemia). In addition, the unanticipated similarity with neurolysin has shown 
the structural conservation amongst an emerging family of peptidases with a common 
evolutionary origin. 
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In a further aspect, the present invention relates to a composition comprising ACE in 
a crystalline form. 



In a further aspect, the present invention relates to a scalable 3D model of ACE 
having at least a portion of the structure co-ordinates shown in Table A or Table B. 

All publications mentioned in the above specification are herein incorporated by 
reference. Various modifications and variations of the described methods and system 
of the invention will be apparent to those skilled in the art without departing from the 
scope and spirit of the invention. Although the invention has been described in 
connection with specific preferred embodiments, it should be understood that the 
invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention 
which are obvious to those skilled in molecular biology or related fields are intended 
to be within the scope of the following claims. 
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Table 1 



-Lisinopril Contact Residues (hydrogen bonds at the active site) 
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2.70 


Ol 


His 544 NE2 
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Zinc Coordination-ACE-Lisinopril Complex 



Coordination Distance (A) 

Zn His 414 NE2 2.04 

Zn His 418 NE2 2.07 

2n Glu 442 OE1 2.00 

Lis 03 2.14 



Lis = Lisinopril 




Table 2 



Data Collection 



Crystal 


Wavelength(A) 


Resolution range 3 
(A) 


% 

completeness 


<!>/< a> a 


r> ab 
Emerge 

(%) 


MAD Phasing 


Zn Peak 1.2825 


50-2.00(2.07-2.00) 


94.7 (63.2) 


11.54(1.93) 


5.4(22.3) 
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15.48(2.63) 


4.8(14.9) 


Long Wavelength 
1.7712 


50-2.66(2.76-2.66) 
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8.3(21.8) 


MIRAS 
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90.2(83.6) 


9.94(4.52) 


11.2(27.1) 


Pt25 
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13.6(4.17) 


8.8(27.8) 


Pd 
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50-2.18(2.26-2.18) 


89.4(81.2) 


26.14(6.73) 


5.8(25.8) 


Pd2 
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30-2.60(2.69-2.6) 


95.2(86.7) 


16.95(5.46) 


9.5(35.7) 


Oscl 


1.488 


50-2.80(2.9-2.8) 


96.6(91.4) 


18.51(6.16) 


13.1(40.4) 


Native Data 


1.488 


50-2.00(2.07-2.0) 


98.8(95.7) 


22.46(4.44) 
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Model 


Resolution 
(A) 


Reflections 
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Rciyst 
(%) 


Rfree d ( 
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B factor Rmsd (A 2 ) 












Bond 

length 

(A) 


Bond 
angle 
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Bonded 
Main Chain 
Atoms 


Bonded 
Side 
Chain 
Atoms 


Native 
(ACE) 


47.14-2.0 


39727/ 1675 


18.29 


22.08 


0.005 


1.22 


1.281 


2.126 


Lisinopril 

complex(A 

CE-LIS) 


47.14-2.0 


76164/3094 


18.14 


21.88 


0.0055 


1.24 


1.174 


2.048 



a Values in parentheses are for the outer resolution shell. 

b Rmcrge 888 2 1 1 - <I> | / £ I, where I is the observed intensity and <I> is the average intensity obtained 
from multiple observations of symmetry related reflections. 
10 '"Rcrys^SIFol-IFcl/ZjFol where Fo and Fc are observed and calculated structure factors respectively. 
d Rfo; C is obtained for a test set of reflections, consisting of a randomly selected ~ 4 % of reflections. 
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Table 3 " 

Kinetic parameters for the hydrolysis of ffip-His-Leu by various tACE glycoforms 
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tACEA36-gl234 


1.5 


85 


56 


tACEA36-NJ I 


2.6 


253 


99 


tACEA36-NJ 2 


4.1 
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70 



1 Expressed in CHO cells treated with NB-DNJ. 

2 Expressed in NB-DNJ-untreated CHO cells. 




Table 4 



Mass Spectral Anafysis of C-Terminal Endoproteinase Lys-C Peptides' 



Peptide 










(residue no?) 


tACEA36-gl 


tACEA36g-l 


tACEA36-gl234 


tACEA36NJ 


598-613° 
614-627 d 
6i4-625 d 


1951.6 (1951.2) 

1689.7 (1690.8) 


1950.6 (1951.2) 
1690.5(1690.8) 


1950.2 (1951.2) 
1690.9 (1690.5) 


1951.5 (1951.2) 
1463.1 (1463.5) 



a Soluble (shed) ACE proteins were purified from conditioned medium of transfected cells, digested 
with endoproteinase Lys-C and analyzed by MALDI-TOF mass spectrometry. 

15 

b Amino acid residue numbering refers to wild type tACE (Ehlers et al 1989). Shown are the masses of 
the penultimate c and ultimate d C-terminal peptides. All values are calculated for protonated 
isotopically averaged molecular masses m/z. In parentheses are expected masses. 
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Table A 

Co-ordinates of underglycosylated tACEA3 6NJ ACE 

5 

rpISJS c °ordinates from minimization refinement 
KtiMARK refinement resolution: 4 7.14 - 2 0 A 
REMARK starting r= 0.1829 free r= 0.2206 
REMARK final r = 0.1829 free r= 0.2208 
10 REMARK rmsd bonds= 0.005706 rmsd angles= 1 22440 
REMARK wa= 0.837519 " U 
REMARK target= mlf cycles= 1 steps= 20 

t£S.\ S 0 3 ;^t > U 1)2W - 56621 b = - "3.7.1 axpha- ,0. 

15 REMARK ncs= none 

REMARK B-correction resolution: 6 0-20 

REMARK initial B-factor correction applied to fobs • 

REMARK Bll- 2.826 B22= -1.896 B33= -0 30 * 

9H p^!^ B12= °- 000 B13= °- 000 B 23= 0.000 

rS K - fv Ct °f C ° rrection applied to coordinate array B: 0.207 
REMARK bulk solvent: density level- 0.355826 e/A~3, B-factor- 50 0907 

reflections wit h I Fobs | /sigma F < 0.0 rejected 
25 ReS SfiSJr? ' F ° bS ' > * rmS (Fo^) rejected 

loST% T ° tal nUmber ° f refl - in reso1 - ^ge: 44472 ( 

REMARK number of unobserved reflections (no entry or |F|M» :3070 ( 

REMARK number of reflections rejected: n , n n » , 

30 REMARK. total number of reflections used: 41402 93 1 t 

REMARK number of reflections in working set: 39727 89*3 % 

REMARK VERsioN^l . 1 133 " 791 90 ' 00 9 °-°° 9 0.00P2121 2 i 

35 ATOM 2 ? JSt X 34 - 2 ° 5 71 - 662 65 - 620 1-00 47.11 

ATOM ■ ^ X 36 - 244 7 °- 263 65 " 985 1-00 47.34 

ATOM 4 2 !Jt £ 36 - 117 69 -° 44 65 - 862 L00 47 ' 51 

MOM 5 CA St ni 35 ' 784 72 - 211 67 - 437 L00 47 - 9 * 

40 ATOM 6 £ ifS ?J 35,164 71 -° 98 66 - 663 L00 47 '50 

JtoM 7 CA 25 70 ^- 3 ° 7 7 °- 929 65 - 547 L00 46 - 68 

ATOM 8 CR S5 ™ 38 ' 419 7 °- 260 64 ' 882 1 - 00 4 6-09 

ATOM 9 2 3*. 39 • 315 71 • 298 64 -200 1.00 4 6.87 

ATOM 10 CD rrn ™ • 40 ' 37 ° ? ° ' 727 63 ' 262 L00 47.65 

45 atom J"? 72 39 • 776 70 • 111 62 -005 LOO 48 95 

ATOM 12 SSS- £ 3 2-" ? 7 °- 801 61 - 309 L00 48.20 

ATOM 13 c rrn ™ ^-° 96 68 " 939 61 ' 7 08 1.00 49.03 

ATOM 14 o S£ '-- 223 69 - 474 65 - 916 L00 45.81 

ATOM 15 2 ?S 2? 39 ' 862 68 ' 466 65 - 594 L00 45.67 

50 ATOM " CA AT^ 11 f- 184 69 - 941 67 - 161 L 00 43.82 

ATOM in rn ara 11 39 - 905 69 ' 292 68 - 249 L00 42.35 

ATOM 1Q c £i II II' 995 7 °- 229 69 - 446 L00 42.98 

ATOM 19 o iS ?J -- 21 ° 67 -" 6 68 - 646 ^ 40 ' 96 

ATOM J S 73 39 - 863 66 -"0 68.926 1.00 39.62 

55 ATOM 21 CA Hu n\ Vn^ 68 " 680 L00 40.60 

ATOM 22 CB tin n\ 66,844 69 ' 025 L00 40.79 

ATOM 23 CG tin n\ 67 " 190 69 " 157 L00 42.97 

ATOM 11 2? 55 ^ 35 • 138 67 • 389 70 • 581 1-00 48.05 

ATOM 25 2lSS ?1 f' 642 6? - 644 7 °- 649 L00 51.91 

60 ATOM 26 OE2 So ^ 11' l** 66 ' 198 7 °- 142 1 * 00 53 ■ 53 

OE2 GLO 74 33.239 68.687 71.210 1.00 53.71 
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39.899 25.226 13.458 1.00 17.50 
39.121 24.987 12.406 1.00 20.20 
38.505 25.989 11.789 1.00 17.52 
38.967 23.742 11.963 1.00 18.66 
40.592 23.615 17.793 1.00 17.62 
40.369 22.490 17.340 1.00 16.82 
41.456 23.838 18.782 1.00 16.05 
42.213 22.751 19.400 1.00 16.40 
43.247 23.312 20.369 1.00 15.81 
41.288 21.770 20.122 1.00 17.14 
41.527 20.560 20.111 1.00 15.20 
40.237 22.289 20.748 1.00 14.27 
39.285 21.428 21.449 1.00 16.86 

38.331 22.266 22.301 1.00 15.32 
38.945 22.925 23.542 1.00 17.90 
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37.292 21.621 17.062 1.00 18.41 

36.648 21.061 15.831 1.00 18.72 
35.373 21.137 15.380 1.00 17.00 
37.351 20.359 14.875 1.00 18.34 
36.537 20.031 13.887 1.00 14.90 

35.332 20.492 14.169 1.00 16.90 
38.361 19.421 17.740 1.00 19.03 
37.866 18.327 17.471 1.00 17.36 
39.666 19.659 17.700 1.00 18.55 

40.649 18.662 17.295 1.00 20.61 
42.028 19.320 17.254 1.00 23.74 
42.870 19.014 16.032 1.00 30.47 
43.922 20.102 15.857 1.00 33.76 
44.686 20.306 17.083 1.00 33.81 
45.113 21.487 17.514 1.00 35.70 
44.852 22.587 16.817 1.00 35.23 
45.800 21.569 18.646 1.00 36.87 
40.679 17.468 18.253 1.00 20.35 
40.773 16.314 17.825 1.00 19.01 
40.586 17.747 19.550 1.00 19.12 
40.641 16.692 20.559 1.00 20.16 
41.247 17.230 21.858 1.00 21.21 
41.407 16.184 22.916 1.00 23.81 
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795 40.088 34.807 
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802 59.869 33.554 

803 38.591 20.686 
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820 50.799 29.874 

821 27.864 56.624 

823 27.817 17.957 
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825 9.453 31.180 
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827 50.893 18.450 

828 26.079 14.620 

829 69.963 44.562 

830 56.486 22.481 

831 26.101 20.243 

832 16.729 33.274 

833 21.501 54.137 
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1056 58.449 30.554 

1057 25.498 11.492 

1058 44.975 29.368 

1059 25.317 25.265 

1061 28.989 59.107 

1062 62.109 32.898 

1063 68.950 47.081 
1066 53.065 54.350 
1068 44.519 16.873 
1070 45.839 26.154 

1072 36.197 36.508 

1073 35.351 41.501 

1074 65.633 34.803 

1078 41.335 38.420 

1079 29.512 38.685 

1080 14.118 37.523 

1081 48.556 54.366 

1082 40.975 25.862 

1083 40.345 57.934 

1084 54.691 59.362 

1085 53.736 57.789 

1086 15.708 33.122 

1087 22.243 32.265 

1088 47.611 17.171 

1089 58.379 48.121 

1092 57.488 59.756 

1093 37.216 31.175 

1094 37.762 33.656 

1095 65.445 43.712 

1097 28.164 21.093 

1098 18.673 33.165 

1100 51.193 61.127 

1101 56.432 43.982 

1102 56.968 41.831 

1103 34.405 49.957 

1104 41.241 9.964 

1105 21.105 21.857 

1106 23.670 20.267 

1108 29.571 9.657 

1109 27.460 9.067 
1111 24.692 47.272 
1113 29.462 58.167 

1115 " 26.304 58.757 

1116 22.934 60.561 

1117 39.244 37.381 
1119 37.428 43.614 

1121 17.907 30.515 

1122 33.636 10.725 

1123 37.687 60.192 

1124 40.750 44.964 

1125 70.584 38.077 

1126 15.603 41.927 

1127 25.206 43.505 

1128 26.442 21.223 

1130 42.766 41.142 

1131 43.109 61.103 

1132 14.570 30.169 

1133 34.157 38.583 
1135 35.943 44.155 
1137 15.347 36.178 
1142 30.780 45.800 
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Table B 



Co-ordinates of an underglycosylated tACEA36NJ ACE-lisinopril complex 



5 REMARK coordinates from minimization refinement 
REMARK refinement resolution: 4 7.14 - 2.0 A 
REMARK starting r= 0.1816 free_r= 0.2186 
REMARK final r= 0.1814 free_r- 0.2188 
REMARK rmsd bonds= 0.005621 rmsd angles= 1.23822 
10 REMARK wa= 0.450786 

REMARK target^ mlf cycles- 1 steps= 10 

REMARK sg= P2 ( 1 ) 2 ( 1 ) 2 ( 1 ) a= 56.472 b= 84.899 c= 133.990 alpha= 90 
beta= 90 gamma= 90 

REMARK anomalous f ! f t! library: CNS_XRAYLIB : anom_cu . lib 
15 REMARK reflection file- . . /dmlO_2A. cv 
REMARK ncs- none 

REMARK B~correction resolution: 6.0 - 2.0 
REMARK initial B-factor correction applied to fobs : 
REMARK Bll= 1.410 B22= 0.301 B33= -1.711 
20 REMARK B12= 0.000 B13= 0.000 B23= 0.000 

REMARK B-factor correction applied to coordinate array B: 0.605 
REMARK bulk solvent: density levels 0.336734 e/A A 3, B-factor= 42.5542 
A" 2 

REMARK reflections with | Fobs | /sigma_F < 0.0 rejected 
25 REMARK reflections with | Fobs | > 10000 * rms(Fobs) rejected 
REMARK anomalous diffraction data was input 

REMARK theoretical total number of refl. in resol. range: 84079 ( 
100.0 % ) 

REMARK number of unobserved reflections (no entry or |F|=0):821 ( 
30 5.7 % ) 
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585 


1. 


00 


31. 


36 
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385 


O 
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117 


28 . 


287 


43. 


212 


68. 


697 


1. 


00 


29. 


55 


tv mrwA 
AIUM 
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N 
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118 


29. 


768 


41. 


576 


69.106 


1. 


00 


31. 


05 


ATOM 


00/ 


CA 


GLN 


llo 


0 r"v 

30 . 


653 


42 . 


447 


69. 


868 


1. 


00 


31. 


16 


ATOM 


388 


CB 


GLN 


118 


31. 


741 


41. 


625 


70. 


563 


1. 


00 


34. 


52 


ATOM 


389 


CG 


GLN 


118 


31. 


195 


40. 


567 


71. 


509 


1. 


00 


38. 


71 


ATOM 


390 


CD 


GLN 


118 


32. 


270 


39. 


974 


72. 


402 


1. 


00 


41. 
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391 


OE1 
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118 
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393 
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GLN 
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5 
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401 


C 
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O 
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119 
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N 


ALA 
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404 


CA 
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15 


ATOM 


405 


CB 


ALA 


120 




ATOM 
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120 
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O 


ALA 


120 




ATOM 


408 


N 


ASN 


121 




ATOM 


409 


CA 


ASN 


121 


20 


ATOM 


410 


CB 


ASN 


121 




ATOM 


411 


CG 


ASN 


121 




ATOM 


412 


OD1 


ASN 


121 




ATOM 


413 


ND2 


ASN 


121 




ATOM 


414 


C 


ASN 


121 


25 


ATOM 
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ASN 


121 




ATOM 
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HIS 
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ATOM 
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CA 
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ATOM 


418 


CB 
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ATOM 


419 


CG 
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30 


ATOM 


420 


CD2 


HIS 


122 




ATOM 


421 


ND1 


HIS 


122 




ATOM 


422 


CE1 


HIS 


122 



423 NE2 HIS 122 

ATOM 424 C HIS 122 

35 ATOM 425 O HIS 122 

426 N THR 123 

427 CA THR 123 
ATOM 4 28 CB THR 123 
ATOM 42 9 OG1 THR 123 

40 ATOM 4 30 CG2 THR 123 

ATOM 431 C THR 123 

432 O THR 123 

433 N LEU 124 
ATOM 4 34 CA LEU 124 

45 ATOM , 435 CB LEU 124 

436 CG LEU 124 



ATOM 



ATOM 
ATOM 



ATOM 
ATOM 



ATOM 



ATOM 


437 


CD1 


LEU 


124 


ATOM 


438 


CD2 


LEU 


124 


ATOM 


439 


C 


LEU 


124 


ATOM 


440 


O 


LEU 


124 


ATOM 


441 


N 


LYS 


125 


ATOM 


442 


CA 


LYS 


125 


ATOM ' 


443 


CB 


LYS 


125 


ATOM 


444 


CG 


LYS 


125 


ATOM 


445 


CD 


LYS 


125 


ATOM 


446 


CE 


LYS 


125 


ATOM 


447 


NZ 


LYS 


125 


ATOM 


448 


C 


LYS 


125 


ATOM 


449 


O 


LYS 


125 


ATOM 


450 


N 


TYR 


126 


ATOM 


451 


CA 


TYR 


126 



181 



33.294 39.485 71.922 1.00 43.74 
32.039 40.013 73.711 1.00 42.41 
31 298 43.519 68.999 1-00 29.23 
31.318 44.698 69.365 1.00 28.92 
31.822 43.120 67.846 1.00 25.46 
32.457 44.081 66.957 1.00 24.61 
33.281 43.365 65.853 1.00 25.09 
32.362 42.581 64.923 1.00 24.21 
34 096 44.392 65.071 1.00 25.85 
35.175 43.779 64.208 1.00 26.22 
31 401 44.995 66.329 1.00 22.47 
31 674 46.153 66.003 1.00 22.00 
30 187 44.481 66.171 1.00 21.10 
29.109 45:283 65.599 1.00 22.02 
27.902 44.401 65.294 1.00 20.08 
28.734 46.359 66.616 1.00 22.84 
28.410 47.490 66.264 1.00 22.93 
28.790 45.987 67.888 1.00 21.65 
28.468 46.901 " 68.970 1.00 23.13 
28.572 46.166 70.300 1.00 27.59 
27 649 46.732 71.342 1.0Q 34.63 
28.016 47.635 72.096 1.00 39.08 

26.427 46.206 71.389 1.00 38.10 

29.428 48.086 68.951 1.00 21.02 
29 020 49.241 69.101 1.00 20.93 
30.710 47.789 68.776 1.00 17.91 
31.739 48.817 68.716 1.00 17.11 
33 122 48.152 68.661 1.00 15.18 
34.235 49.093 68.327 1.00 16.93 
35.035 49.844 69.119 1.00 15.16 
34.617 49.363 67.031 1.00 18.07 
35.606 50.240 67.041 1.00 18.19 
35.879 50.547 68.296 1.00 15.55 
31.520 49.698 67.487 1.00 17.81 
31.628 50.925 67.556 1.00 18.93 
31.213 49.066 66.360 1.00 17.71 
30.983 49.794 65.117 1.00 18.02 
30.700 48.823 63.956 1.00 17.98 

31.807 47.926 63.810 1.00 16.69 
30.494 49.591 62.646 1.00 18.45 

29.808 50.758 65.260 1.00 18.44 
29.872 51.903 64.817 1.00 18.36 
28 737 50.292 65.886 1.00 19.48 
27.556 51.123 66.083 1.00 21.69 
26.407 50.264 66.625 1.00 22.76 
25.009 50.870 66.792 1.00 25.84 
24.634 51.743 65.596 1.00 26.67 
24.015 49.731 66.952 1.00 28.14 
27.849 52.293 67.028 1.00 22.10 
27.395 53.414 66.805 1.00 22.48 
28.619 52.029 68.077- 1.00 21.30 
28.968 53.061 69.044 1.00 21.15 
29.826 52.468 70.165 1.00 22.88 
30.218 53.462 71.247 1-00 24.84 
31.157 52.829 72.267 1.00 27.95 
31.480 53.799 73.392 1.00 30.77 
31 992 55.102 72.875 1.00 33.91 
29.727 54.203 68.377 1.00 20.96 
29.344 55.367 68.492 1-00 20.95 
30.805 53.869 67.675 1.00 18.64 
31 598 54.893 67.020 1.00 17.79 
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ATOM 

ATOM 
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ATOM 
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ATOM 
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ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



4 52 CB TYR 

453 CG TYR 

454 CD1 TYR 
4 55 CE1 TYR 
4 56 CD2 TYR 
4 57 CE2' TYR 

CZ TYR 
OH TYR 
C TYR 
O TYR 
N GLY 



458 
459 
4 60 
461 
462 
4 63 
464 
4 65 
466 



CA GLY 
C GLY 



4 67 CA 
4 68 CB 



O. GLY 
N THR 
THR 
THR 
4 69 OG1 THR 
CG2 THR 
C THR 
O THR 
N GLN 



470 
471 
472 
473 
474 
475 
476 
477 
478 
479 
480 
481 
482 



CA GLN 
CB GLN 
CG GLN 
CD GLN 
OE1 GLN 
NE2 GLN 
C GLN 



O 
N 



GLN 
ALA 



4 83 CA ALA 

4 84 CB ALA 

C ALA 

O ALA 

N ARG 



485 
486 
487 
488 



489 CB 
490 
491 



CA ARG 
ARG 



4 9; 



CG ARG 
CD ARG 



NE 



493 CZ 
494 
495 



ARG 
ARG 



O 
N 



496 
497 
498 
499 
500 
501 
502 
503 
504 
505 
506 
507 
508 
509 CB 



NH1 ARG 
NH2 ARG 
C ARG 



ARG 
LYS 



CA LYS 
CB LYS 



CG 
CD 
CE 
NZ 
C 
O 
N 

CA 



510 
511 



CG 



LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
PHE 
PHE 
PHE 
PHE 



CD1 PHE 



126 

126 

126 

126 

126 

126 

126 

126 

126 

126 

127 

127 

127 

127 

128 

128 

123 

128 

128 

128 

128 

129 

129 

129 

129 

129 

129 

129 

129 

129 

130 

130 

130 

130 

130 

131 

131 

131 

131 

131 

131 

131 

131 

131 

131 

131 

132 

132 

132 

132 

132 

132 

132 

132 

132 

133 

133 

133 

133 

133 



33.026 
33.817 
33. 687 
34.326 
34. 618 
35.260 
35.107 
35.717 
30. 983 
31. 168 
30.239 
29.596 
28.534 
28.366 
27.815 
26.774 
25.997 
25.413 
24. 896 
27.409 
26. 887 
28.542 
29.262 
30.455 
30.215 
31.440 
31.608 
32.310 
29.781 
29. 638 
30.389 
30. 956 
31. 622 
29.927 
30.245 
28 . 697 
27. 634 
26.436 
26. 67b 
25.435 
604 
642 



25 
24 



512 CD2 PHE 133 



23.441 

24.877 

27.188 

26.512 

27.562 

27.195 

27.232 

26. 128 

26.278 

25.149 

25.398 

28.113 

27.815 

29.235 

30. 178 

31. 606 

32.297 

32.189 

33. 063 



54.388 
54.315 
53.226 
53.209 
55.383 
55.376 
54.290 
54.301 
55.411 
56.573 
54.554 
54.977 
56.020 
56.984 
55.825 
56.759 
56.216 
54.951 
57.188 
58.107 
59.161 
58.064 
59.280 
58.939 
59.095 
58 .719 
57.567 
59. 693 
60.020 
61. 238 
59.270 
59.831 
58.722 
60.561 
61.547 
60.066 
60. 680 
59.736. 
58.528 
57. 669 
56.546 
55. 679 
55.803 
54 . 692 
62.030 
62.801 
62.309 
63.569 
63.425 
62.535 
62.309 
61. 442 
60. 989 
64.698 
65.872 
64 .344 
65.347 
64.786 
64.597 
63.396 
65. 625 



66.782 
68.071 
68.934 
70.171 
68.476 
69.710 
70.552 
71.783 
65.733 
65.383 
65.038 
63.806 
64.099 
63.351 
65.199 
65.607 
66.826 
66.487 
67.244 
65.958 
65.599 
66.655 
67.038 
67.938 
69.432 
70.262 
70. 666 
70.504 
65.810 
65. 698 
64.396 
63.673 
62.858 
62.815 
62. 156 
62.817 
62.036 
61 . 952 
61.066 
60.979 
60.062 
59.766 
60.317 
58.910 
62.599 
61.913 
63.844 
64.479 
66.002 
66.553 
68 . 048 
68.580 
69.977 
64.036 
64.255 
63.417 
62.928 
62.822 
64.140 
64 .833 
64. 686 



1.00 18.37 
1.00 20.45 
1.00 20.37 
1.00 21.90 
1.00 20.13 
1.00 21.70 
1.00 21.90 
1.00 22.74 



1 

1 

1 

1 

1 

1 

1 

1 

1 

1, 

1. 

1. 

1. 

1. 

1. 



.00 16.79 
.00 17.06 
.00 17.21 
.00 16.33 
.00 17.87 
00 17.27 
00 17.84 
00 19.72 
00 19.70 
00 21.12 
00 20.40 
00 20.53 
00 20. 68 
00 21.83 
00 21.59 



1.00 26.47 
1.00 31.89 
1.00 34.85 
1.00 35.57 
1.00 36.98 
1.00 20.77 
1.00 19.33 
1.00 18.52 
1.00 17.79 
1.00 17.56 
1.00 16.56 
1.00 16.12 
1. 00 17 . 47 
1.00 19.03 
1.00 18.10 
1.00 18.60 
1 
1 
1 
1. 
1. 



00 20 
00 17. 



30 
51 

00 19.71 
00 17.67 
00 19.33 
1.00 21.80 
1.00 20.73 
1.00 22.02 
1.00 24.28 



.00 25.70 
.00 31.41 
.00 33.41 
.00 36.75 
00 38.85 
00 24.23 
00 24.28 
00 25.03 
00 26.68 
00 26.18 
00 26.01 



1.00 26.90 
1.00 27.15 



10 



15 



20 



25 



30 



35 



40 



'45 



50 



55 



60 



ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
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ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
' ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
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ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 



513 
514 
515 
516 
517 
518 
519 
520 
521 
522 
523 
524 
525 
526 * 
527 
528 
529 
530 
531 
532 
533 
534 
535 
536 
537 
538 
539 
540 
541 
542 
543 
544 
545 
546 
547 
548 
549 
.550 
551 
552 
553 
554 
555 
556 
557 
558 
559 
560 
561 
562 
563 
564 
565 
566 
567 
568 
569 
570 
571 
572 
573 



CE1 PHE 
CE2 PHE 
CZ PHE 



PHE 
PHE 
ASP 



CA ASP 
CB ASP 
CG ASP 
OD1 ASP 
OD2 ASP 
C ASP 



O 
N 

CA 
CB 



ASP 
VAXi 
VAL 
VAL 



CGI 


VAL 


CG2 


VAL 


c 


VAL 


o 


VAL 


N 


ASN 


CA 


ASN 


CB 


ASN 


CG 


ASN 


OD1 


ASN 


ND2 


ASN 


C 


ASN 


O 


ASN 


N 


GLN 


CA 


GLN 


CB 


GLN 


CG 


GLN 


CD 


GLN 


OE1 


GLN 


-NE2 


GLN 


C 


GLN 


O 


GLN 


N 


LEU 


CA 


LEU 


CB 


LEU 


CG 


LEU 


CD1 


LEU 


CD2 


LEU 


C 


LEU 


O 


LEU 


N 


GLN 



CA 
CB 
CG 
CD 



GLN 
GLN 
GLN 
GLN 



OE1 GLN 
NE2 GLN 
C GLN 



O GLN 
N ASN 
CA ASN 



CB 
CG 



ASN 
ASN 



OD1 ASN 
ND2 ASN 
C ASN 



133 

133 

133 

133 

133 

134 

134 

134 

134 

134 

134 

134 

134 

135 

135 

135 

135 

135 

135 

135 

136 

136 

136 

136 

136 

136 

136 

136 

137 

137 

137 

137 

137 

137 

137 

137 

137 

138 

138 

138 

138 

138 

138 

138 

138 

139 

139 

139 

139 

139 

139 

139 

139 

139 

140 

140 

140 

140 

140 

140 

140 



183 



32.837 
33.714 
33.602 
29.766 
29.350 
29.876 
29.561 
29.048 
28.560 
29.035 
27.710 
30.896 
31.721 
31.110 
32.353 
32.308 
33.587 
32. 113 
32. 660 
33.824 
31. 615 
31.790 
30.430 
29.697 
30.246 
28 .449 
32.555 
33.238 
. 32.445 
33.114 
32.192 
30 .886 
31.108 
31.777 
30.540. 
34.471 
35. 136 
34 . 886 
36.162 
36.140 
36.216 
35.438 
35. 678 
37.330 
37.350 
38.310 
39.476 
40.141 
40.513 
39.295 
38 .423 
39.230 
40.516 
40. 868 
41.016 
42.018 
42.428 
43. 640 
43. 676 
• 44.643 
41.443 



63.218 
65.461 
64 .256 
65.801 
64 . 989 
67 .099 
67. 611 
69.051 
69.538 
69.008 
70.450 
67.577 
68.478 
66.522 
66.354 
65.059 
64 . 904 
63.864 
67 .549 
67 .884 
68 .194 
69.346 
69.871 
68.881 
68.397 
68.575 
70.478 
71.251 
70.568 
71. 628 
72.122 
72.738 
73.973 
74. 922 
73. 970 
71.237 
72.057 
69. 992 
69.516 
67.991 
67.413 
68.279 
65. 994 
69. 980 
69. 764 
70.612 
71.144 
72.224 
73. 479 
74.262 
73.729 
75.535 
70.094 
69.970 
69.353 
68 . 320 
67 . 659 
66.763 
65 . 880 
66. 982 
67.273 



66.057 
65.909 
66.595 
61.540 
60.714 
61.281 
59. 957 
60. 007 
58. 648 
57. 620 
58. 602 
59.231 
59.381 
58. 457 
57 . 722 
56.880 
56.083 
57 .794 
56.820 
56.599 
56.310 
55.432 
54.961 
54.079 
53.089 
54 . 433 
56. 103 
55.436 
57.424 
58.171 
59. 290 
58.797 
57 . 939 
58.355 
56.736 
58.756 
59.389 
58.539 
59.061 
59. 166 
60.585 
61.565 
60.57 6 
58.198 
56.987 
58.837 
58.140 
58.995 
58.225 
57.761 
57.070 
58.138 
57.755 
56.583 
58.739 
58.486 
59.807 
59. 660 
58.801 
60.504 
57 .529 



1.00 28.34 
1.00 26.84 
1.00 27.99 
1.00 27.95 
1.00 28.75 
1.00 29.55 
1.00 30.26 
1.00 32.19 



1.00 32.97 
1.00 33.76 
1.00 35.71 
1.00 30.16 
1.00 30.39 
1.00 31.36 
1.00 31.74 
1.00 32.36 
1.00 33.06 
1.00 31.68 
1.00 32.14 

1.00 32.77 

1.00 32.14 

l'.OO 34.09 

1.00 34.55 

1. 

1. 

1 

1 

1 

1 

1 

1 

1 

1 

1.00 49.72 
1.00 49.42 
1.00 36.89 
1.00 36.28 
1.00 36.72 



00 36.54 
00 34.66 
00 35.78 
00 34 .86 
.00 34.66 
.00 36.37 
.00 37.67 
.00 40.06 
.00 43.99 
.00 47.20 



00 



35.87 
00 36.10 
00 36.93 
00 36.66 
.00 37.81 
,00 36.49 
.00 37.24 
.00 36.86 
.00 37.44 
.00 39.57 
.00 43.68 
.00 46.04 
1.00 48.18 
1.00 46.25 



00 36.44 
00 37.59 
00 35.34 
00 34.29 
00 35.01 
00 36.54 
1.00 37.38 
1.00 38.59 
1.00 33.04 
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31.084 10.792 49.428 1.00 31.98 
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36.285 16.133 47.508 1.00 20.57 

36.157 17.502 47.976 1.00 21.68 

37.052 17.737 49.193 1.00 22.65 

36.635 16.951 50.422 1.00 27.39 

37.541 17.213 51.609 1.00 28.93 

38.750 16.992 51.543 1.00 31.20 

36.958 17.685 52.703 1.00 31.79 

36.629 18.361 46.804 1.00 20.53 

37.304 17.857 45.902 1.00 17.89 

36.279 19.646 46.814 1.00 18.80 

36.683 20.538 45.736 1.00 17.83 

36.197 21.964 46.004 1.00 17.41 

36.569 22.988 44.926 1.00 15. 8? 

35.877 22.630 43.615 1.0.0 13.76 

36.164 24.377 45.378 1.00 17. ?8 

38.199 20.540 45.568 1.00 19.20 

38.713 20.284 44.483 1.00 19.52 

38.914 20.837 46.644 1.00 20.67 

40.366 20.860 46.590 1.00 24.08 

40.394 22.208 47.099 1.00 25.06 

40.261 23.400 46.387 1.00 30.04 

41.009 24.704 46.605 1.00 32.48 

41.103 25.165 47.764 1.00 34.10 

41.504 25.270 45.606 1.00 34.77 

40.916 19.718 47.441 1.00 24.65 

40.514 19.542 48.590 1.00 24.48 

41.831 18.914 46.877 1.00 24.86 

42.651 17.967 47.659 1.00 24.60 

42.361 19.042 45.516 1.00 23.62 

43.821 18.693 45.705 1.00 24 03 

43.704 17.508 46.641 1.00 24.48 

41.694 18.072 44.539 1.00 22.86 

41.957 18.116 43.337 1.00 21.75 

40.840 17.201 45.067 1.00 21.42 

40.176 16.177 44.267 1.00 22.19 

39.183 15.395 45.130 1.00 21.92 
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40.997 14.287 46.197 1.00 27.92 
39.208 14.687 47.402 1.00 30.18 
39.488 16.623 42.985 1.00 21.21 
39.953 16.301 41.893 1.00 23.11 
38.383 17.351 43.107 1.00 19.29 
37.662 17.793 41.921 1.00 18.52 
36.360 18.487 42.324 1.00 16.88 
35.342 17.535 42.964 1.00 19.05 
34.224 18.325 43.620 1.00 19.51 
34.793 16.583 41.901 1.00 19.00 
38.519 18.708 41.061 1.00 17.36 
38.435 18.675 39.837 1.00 16.52 
39.346 19.517 41.713 1.00 17.60 
40.231 20.440 41.016 1.00 17.78 
41.020 21.316 42.023 1.00 18.34 
40.137 22.295 42.592 1.00 22.95 
42.172 22.032 41.334 1.00 22.61 
41.205 19.679 40.120 1.00 18.14 
41.460 20.089 38.987 1.00 15.11 
41.747 18.575 40.633 1.00 18.47 
42.679 17.760 39.860 1.00 19.73 
43.298 16.648 40.721 1.00 24.67 
44.208 17.185 41.815 1.00 30.01 
44.853 18.223 41.655 1.00 32.67 
44.282 16.460 42.930 1.00 33.72 
41.968 17.128 38.669 1.00 16.49 ■ 
42.514 17.091 37.569 1.00 17.40 
40.754 16.626 38.889 1.00 15.82 
39.988 16.013 37.806 1.00 14.87 
38.605 15.504 38.290 1.00 15.42 
37.766 15.069 37.091 1.00 15.86 
38.776 14.317 39.255 1.00 14.36 
39 771 17.013 36.666 1.00 14.32 
39.965 16.688 35.491 1.00 11.80 
39.366 18.231 37.016 1.00 14.83 
39.123 19.259 36.008 1.00 14.38 
38 524 20.517 36.650 1.00 12.52 
37.128 20.328 37.23.3 1.00 12.28 
35.935 19.690 36.045 1.00 14.88 
35.797 21.097 34.903 1.00 12.30 
40.405 19.632 35.274 1.00 13.50 
40.375 19.989 34.099 1.00 13.68 
41.533 19.535 35.966 1.00 12.51 
42.814 19.891 35.372 1.00 14.22 
43.762 20.372 36.465 1.00 12.68 
43.501 18.794 34.556 1.00 14.13 
44.239 19.092 33.619 1.00 13.71 
43.246 17.534 34.889 1.00 13.05 
43.932 16.442 34.211 1.00 14.28 
44.677 15.579 35.243 1.00 13.91 
43.734 15.052 36.182 1.00 16.38 
45.702 16.412 35.995 1.00 14.07 
43.105 15.512 33.329 1.00 14.85 
43.631 14.920 32.382 1.00 13.44 
41.824 15.356 33.637 1.00 14.42 
40.993 14.482 32.821 1.00 15.37 
39.620 14.296 33.458 1.00 15.43 
38.771 13.571 32.585 1.00 16.22 
40.817 15.043 31.410 1.00 17.26 
40.718 16.258 31.216 1.00 15.56 
40.788 14.143 30.431 1.00 17.39 
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31.554 20.212 35.099 1.00 13.04 
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20.853 52.155 55.941 1.00 15.52 
20.731 51.075 57.025 1.00 14.97 
19.324 50.472 57.150 1.00 16.48 
19.332 49.281 58.102 1.00 14.35 
18.361 51.552 57.644 1.00 17.43 
22.159 52.927 56.096 1.00 15.83 
22.331 53.695 57.047 1.00 15.43 
23.074 52.729 55.152 1.00 14.16 
24.350 53.426 55.185 1-00 13.59 
25.500 52.423 55.067 1.00 12.71 
25.778 51.711 56.384 1.00 15.20 
26.069 52.354 57.395 1.00 14.44 
25.678 50.385 56.382 1.00 13.74 
24.463 54.516 54.120 1.00 13.25 
25.540 55.063 53.894 1.00 13.69 
23.346 54.820 53.462 1.00 13.09 
23.341 55.876 52.462 1.00 14.67 
23.442 55.495 50.997 1.00 12.85 
23.392 56.375 50.137 1.00 13.08, 
23.595 54.207 50.701 1.00 12.73 
23.697 53.752 49.312 1.00 12.34 
24.715 52.609 49.200 1.00 10.80 
26.115 52.990 49.633 1.00 12.14 
26.589 52.660 50.901 1.00 11.88 
27.870 53.038 51.316 1.00 10.81 
26.956 53.711 48.783 1.00 9.56 
28.237 54.096 49.189 1.00 10.67 
28.684 53.757 50.455 1.00 10.80 
29.937 54.150 50.864 1.00 13.51 
22.337 53.281 48.802 1.00 12.43 
21.401 53.121 49.585 1.00 15.41 
22.215 53.064 47.493 1.00 13.45 

20.945 52.596 46.937 1.00 13.11 
20.752 53.071 45.476 1.00 15.33 
20.670 54.595 45.446 1.00 17.06 
21.894 52.589 44.604 1.00 15.86 
20.807 51.073 47.015 1.00 11.99 
19.697 50.549 47.045 1.00 13.05 
21.936 50.368 47.047 1.00 9.74 

21.946 48.907 47.163 1.00 10.63 
21.378 48.233 45.896 1.0Q 9.06 
22.202 48.520 44.644 1.00 11.98 
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ATOM 14 61 CE3 TRP 251 
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ATOM 14 64 CZ2 TRP 251 

ATOM 14 65 CZ3 TRP 251 
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23.385 48.125 44.586 1.00 10.98 
21.658 49.140 43.708 1.00 10.66 
23.373 48.424 47.444 1.00 9.60 
24.308 49.219 47.429 1.00 11.47 
23.538 47.130 47.706 1.00 9.86 
24.858 46.577 48.007 1.00 10.45 
24.738 45.084 48.336 1.00 9.21 
25.895 46.787 46.894 1.00 9.94 
27.072 47.006 47.177 1.00 10.23 
25.465 46.701 45.638 1.00 9.68 
26.388 46.894 44.527 1.00 9.78 
26.974 48.297 44.551 1.00 12.15 

28.177 48.499 44.354 1.00 11.20 
26.103 49.271 44.790 1.00 12.09 
26.494 50.674 44.873 1.00 12.50 
25.246 51.506 45.210 1.00 12.82 
25.528 52.995 45.337 1.00 13.95 
26.395 53.521 44.610 1.00 13.98 
24.851 53.647 46.159 1.00 13.82 
27.562 50.795 45.961 1.0012.92 
28.640 51.350 45.741 1.00 11.95 
27.255 50.251 47.133 1.00 13.21 

28.178 50.277 48.259 1.00 13.25 
27.585 49.479 49.424 1.00 14.69 
28.391 49.595 50.583 1.00 18.32 
29.557 49.705 47.876 1.00 12.60 
30.587 50.324 48.143 1.00 11.74 
29.574 48.530 47.251 1.00 10.27 
30.840 47.908 46.847 1.00 11.25 
30.587 46.525 46.236 1.00 10.10 
30.163 45.479 47.223 1.00 10.89 
29.970 44.084 46.957 1.00 10.95 
29.587 43.475 48.174 1.00 12.43 
30.086 43.288 45.808 1.00 9.38 
29.895 45.657 48.554 1.00 10.57 
29.549 44.457 49.129 1.00 12.56 
29.317 42.101 48.276 1.00 11.07 
29.817 41.923 45.909 1.00 10.48 
29.437 41.345 47.135 1.00 9.03 
31.641 48.751 45.852 1.00 10.66 
32.847 48.926 46.009 1.00 11.21 

30.972 49.266 44.824 1.00 10.93 
31.654 50.075 43.824 1.00 10.15 
30.691 50.462 42.686 1.00 10.20 
30.225 49.288 41.810 1.00 9.68 
29.435 49.763 40.580 1.00 10.48 
28.195 50.475 40.912 1.00 10.26 
27.039 49.892 41.215 1.00 12.09 

25.973 50.633 41.506 1.00 10.42 
26.940 48.569 41.217 1.00 10.98 
32.271 51.337 44.440 1.00 11.46 
33.296 51.821 43.969 1.00 10.38 
31.657 51.852 45.505 1.00 13.01 
32.147 53.065 46.157 1.00 13.62 
31.224 53.465 47.309 1.00 12.96 
31.473 52.676 48.460 1.00 16.21 
33.575 52.932 46.685 1.00 12.79 
34.253 53.935 46.893 1.00 12.73 
34.025 51.699 46.903 1.00 14.11 
35.378 51.462 47.406 1.00 14.12 
35.622 49.960 47.620 1.00 16.31 
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ATOM 1550 C GLU 261 

ATOM 1551 O GLU 261 

ATOM 1552 N GLN 262 

ATOM 1553 CA GLN 262 

5 ATOM 1554 CB GLN 262 

ATOM 1555 CG GLN 262 

ATOM 1556 CD GLN 262 

ATOM 1557 OE1 GLN 262 

ATOM 1558 NE2 GLN 262 

10 ATOM 1559 C GLN 262 

ATOM 15 60 O GLN 262 

ATOM 1561 N ASP 2 63 

ATOM 1562 CA ASP 263 

ATOM 1563 CB ASP 263 

15 ATOM 1564 CG ASP 263 

ATOM 1565 OD1 ASP 263 

ATOM 1566 OD2 ASP 263 

ATOM 1567 C ASP 263 

ATOM 1568 O ASP 263 

20 ATOM 1569 N LEU 264 

ATOM 157 0 CA LEU 264 

ATOM 1571 CB LEU - 2 64 

ATOM 157 2 CG LEU 2 64 

ATOM 157 3 CD1 LEU 264 

25 ATOM 157 4 CD2 LEU 264 

ATOM 1575 C LEU 264 

ATOM 157 6 O LEU 264 

ATOM 1577 N GLU 265 

ATOM 157 8 CA GLU 2 65 

30 ATOM 157 9 CB GLU 265 

ATOM 1580 CG GLU 265 

ATOM 1581 CD GLU 265 

ATOM 1582 OE1 GLU 265 

ATOM 1583 OE2 GLU 265 

35 ATOM 158 4 C GLU 265 

ATOM 1585 O GLU 265 

ATOM- 158 6 N ARG .2 66 

ATOM 1587 CA ARG 266 

ATOM 158 8 CB ARG 2 66 

40 ATOM 158 9 CG ARG 266 

ATOM 1590 CD ARG 266 

ATOM 1591 NE ARG 266 

ATOM 1592 CZ ARG 266 

ATOM 15 93 NH1 ARG 2 66 

45 ATOM 1594 NH2 ARG 266 

ATOM 1595 C ARG 266 

ATOM 1596 O ARG 266 

ATOM 1597 N LEU 267 

ATOM 1598 CA LEU 267 

50 ATOM 159 9 CB LEU 2 67 

ATOM 1600 CG LEU 267 

ATOM 1601 CD1 LEU 267 

ATOM 1602 CD2 LEU 267 

ATOM 1603 C LEU 2 67 

55 ATOM 1604 O LEU 267 

ATOM 1605 N PHE 268 

ATOM 160 6 CA PHE 268 

ATOM 1607 CB PHE 2 68 

ATOM 1608 CG PHE 268 

60 ATOM 1609 CD1 PHE 268 

ATOM 1610 CD2 PHE 268 
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31.923 51.495 33.740 1.00 13.12 

32.193 50.538 33.011 1.00 11.67 

31.988 52.752 33.320 1.00 11.67 

32.395 53.042 31.952 1.00 13.82 

32.128 54.508 31.607 1.00 15.74 

30.659 54.915 31.661 1.00 24.70 

30.455 56.378 31.298 1.00 29.02 

30.558 56.762 30.132 1.00 35.05 

30.183 57.205 32.300 1.00 32.50 

33.883 52.733 31.760 1.00 12.94 

34.285 52.213 30.715 1.00 11.76 

34.696 53.059 32.764 1.00 11.79 

36.131 52.805 32.689 1.00 12.07 

36.845 53.238 33.975 1.00 12.41 

36.890 54.742 34.145 1.00 13.78 

36.821 55.453 33.127 1.00 14.89 

37.015 55.211 35.296 1.00 15.49 

36.410 51.328 32.469 1.00 13.01 

37.190 50.957 31.593 1.00 11.20 

35.773 50.495 33.284 1.00 12.13 

35.958 49.055 33.207 1.00 13.22 

35.259 48.379 34.393 1.00 12.89 

35.758 48.893 35.750 1.00 17.46 

35.091 48.124 36.883 1.00 15.64 

37.281 48.756 35.826 1.00 18.10 

35.449 48.498 31.884 1.00 13.15 

36.049 47.591 31.318 1.00 12.41 

34.348 49.051 31.387 1.00 15.01 

33.788 48.612 30.114 1.00 18.16 

32.457 49.326 29.856 1.00 20.13 

31.833 49.068 28.485 1.00 26.44 

31.578 47.599 28.209 1.00 31.50 
31.057 46.900 29.108 1.00 33.66 
31.886 47.145 27.085 1.00 34.88 
34.773 48.889 28.966 1.00 18.71 
34.969 48.038 28.094 1.00 17.84 
35.389 50.072 28.971 1.00 16.94 

.36.355 50.425 27.932 1.00 17.44 

36.840 51.873 28.101 1.00 20.53 
35.900 52.936 27.550 1.00 22.86 
36.534 54.323 27.645 1.00 23.46 

36.579 54.824 29.016 1.00 25.90 
35.566 55.437 29.623 1.00 28.47 
34.423 55.634 28.979 1.00 29.44 
35.693 55.851 30.876 1.00 30.11 
37.554 49.486 27.986 1.00 15'.99 
38.055 49.042 26.955 1.00 15.42 
38.014 49.188 29.197 1.00 15.40 
39.154 48.299 29.372 1.00 13.76 
39.539 48.228 30.853 1.00 13.89 
40.091 49.539 31.438 1.00 13.95 
40.200 49.433 32.952 1.00 12.21 
41.462 49.849 30.815 1.00 12.02 
38.836 46.907 28.830 1.00 15.08 
39.655 46.296 28.135 1.00 13.11 
37.642 46.412 29.136 1.00 14.37 
37.233 45.098 28.658 1.00 14.85 

35.841 44.735 29.172 1.00 15.58 
35.308 43.453 28.590 1.00 19.42 
35.795 42.219 29.026 1.00 17.97 
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42.890 48.383 46.049 1.00 12.17 

40.663 48.136 46.142 1.00 12.13 

39.250 48.549 46.234 1.00 10.54 

40.789 46.677 46.220 1.00 10.70 

39.338 46.205 46.162 1.00 12.07 

38.607 47.326 46.867 1.00 11.56 

41.674 46.026 45.160 1.00 11.01 

42.343 45.031 45.441 1.00 9.92 

41.682 46.591 43.955 1.00 10.86 

42.494 46.049 42.876 1.00 11.19 

43.984 46.073 43.175 1.00 11.99 

44.710 45.135 42.830 1.00 10.62 

44.452 47.148 43.806 1.00 11.06 

45.866 47.260 44.156 1.00 11.17 

46.157 48.609 44.831 1.00 11.18 

46.364 49.745 43.864 1.00 12.13 

45.377 50.089 42.947 1.00 11.54 

47.554 50.472 43.871 1.00 12.86 

45.572 51.146 42.045 1.00 11.15 

47.755 51.526 42.977 1.00 12.18 

46.763 51.860 42.065 1.00 11.17 

.46.275 46.135 45.098 1.00 11.04 

47.326 45.519 44.926 1.00 10.51 

45.445 45.873 46.104 1.00 10.42 

45.753 44.822 47.066 1.00 11.23 

44.640 44.720 48.116 1.00 11.68 

45.144 44.607 49.523 1.00 11.91 

44.919 45.379 50.614 1.00 10.21 
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45.976 43.589 49.941 1.00 10.64 
46.237 43.736 51.228 1.00 11.94 
45.607 44.815 51.661 1.00 11.66 
45.921 43.475 46.364 1.00 10.83 
46.780 42.674 46.736 1.00 9.33 
45.106 43.241 45.339 1.00 12.01 
45.144 41.991 44.586 1.00 10.66 
43.838 41.819 43.801 1.00 11.06 
42.565 41.755 44.645 1.00 7.88 
42.614 40.643 45.677 1.00 8.56 
43.226 39.598 45.376 1.00 8.96 
42.044 40.809 46.778 1.00 8.52 
46.320 41.897 43.607 1.00 13.30 
46.815 40.807 43,323 1.00 14.12 
46.782 43.038 43.105 1.00 14.06 
47.860 43.056 42.112 1.00 14.79 
47.801 44.369 41.336 1.00 13.68 
49.299 42.808 42.575 1.00 14.18 
50.096 42.229 41.835 1.00 13.95 
49.631 43.249 43.784 1.00 13.76 
50.989 43.119 44.311 1.00 13.53 
51.035 43.500 45.810 1.00 13.26 
52.476 43.493 46.312 1.00 11.75 
50.430 44.898 46.002 1.00 11.55 
51.090 45.972 45.158 1.00 13.98 
51.671 41.757 44.116 1.00 13.95 
52.710 41.666 43.454 1.00 10.36 
51.095 40.709 44.695 1.00 12.71 
51.678 39.389 44.562 1.00 12.69 
51.785 38.914 43.125 1.00 13.85 
52.774 38.284 42.746 1.00 14.14 
50.767 39.215 42.324 1.00 12.80 
50.742 38.805 40.923 1.00 12.50 
49.398 39.181 40.282 1.00 9.49 
48.261 38.259 40.714 1.00 10.86 
48.321 37.706 41.831 1.00 9.74 
47.293 38.098 39.946 1.00 11.31 
51.885 39.421 40.125 1.00 13.90 
52.437 38.782 39.224 1 . 0-0 12.58 
52.230 40.665 40.452 1.00 13.86 
53.307 41.351 39.758 1.00 13.18 
53.520 42.778 40.313 1.00 13.78 
54.745 43.414 39.661 1.00 13.13 
52.280 43.634 40.037 1.00 11.10 
54.600 40.553 39.891 1.00 13.89 
55.263 40.274 38.897 1.00 14.12 
54.954 40.184 41.119 1.00 13.93 
56.164 39.403 41.345 1.00 14.69 
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. 855 


1. 


00 


40. 


, 07 


58. 


974 


29. 


515 


19. 


. 168 


1. 


.00 


38. 


.02 


59. 


,369 


28. 


,213 


18 


. 471 


1. 


.00 


38. 


.51 


58. 


.968 


28. 


.190 


17 


.001 


1 


.00 


39, 


.98 


59, 


,374 


26. 


. 913 


16 


.293 


1 


.00 


40 


.39 


59, 


. 651 


25 


. 908 


16 


. 982 


1 


.00 


39 


. 63 


59 


.397 * 


26 


.915 


15 


.043 


1 


.00 


40 


.40 



58.790 29.276 20.660 1.00 37.16 

57.703 28.901 21.105 1.00 34.83 

59.847 29.487 21.435 1.00 35.29 

59.740 29.305 22.872 1.00 35.03 

61.123 29.172 23.510 1.00 37.48 

61.817 27.889 23.179 1.00 40.71 

61.971 26.749 23.894 1.00 41.63 

62.435 27.665 21.967 1.00 41.83 

62.939 26.444 21.951 1.00 42.38 

62.671 25.866 23.109 1.00 42.85 

58 993 30.497 23.464 1.00 33.66 

58.305 30.361 24.474 1.00 33.56 

59.129 31.660 22.829 1.00 31.69 

58.448 32.864 23.289 1.00 31.39 

58.886 34.096 22.488 1.00 33.71 

60.219 34.644 22.939 1.00 36.85 

60.417 34.795 24.163 1.00 38.01 

61.063 34.939 22.065 1.00 40.28 

56.943 32.705 23.131 1.00 28.93 

56.182 33.000 24.054 1.00 27.31 

56.525 32.252 21.950 1.00 25.74 

55.110 32.059 21.658 1.00 21.87 

54.893 31.631 20.183 1.00 21.83 

53.398 31.481 19.887 1.00 18.55 

55.498 32.680 19.244 1.00 20.20 

54.942 34.083 19.432 1.00 18.72 

54.522 31.009 22.591 1.00 20.48 

53.405 31.171 23.081 1.00 19.85 

55.270 29.937 22.844 1.00 18.44 

54.795 28.894 23.746 1.00 19.25 

55.788 27.727 23.817 1.00 20.64 

55.621 26.738 22.670 1.00 22.25 

54.645 26.785 21.917 1.00 19.60 

56.575 25.822 22.546 1.00 20.31 

54 598 29.475 25.151 1.00 18.68 

53.601 29.183 25.816 1.00 17.02 

55.554 30.288 25.598 1.00 15.56 

55.474 30.913 26.918 1.00 15.68 

56.779 31.646 27.248 1.00 14.76 

56.688 32.527 28.465 1.00 14.35 

56.384 31.987 29.711 1.00 15.03 

56.901 33.898 28.365 1.00 13.50 

56.294 32.798 30.841 1.00 14.00 

56.813 34.720 29.488 1.00 15.93 

56.509 34.167 30.730 1.00 14.91 

54.318 31.908 26.980 1.00 14.40 

53.541 31.917 27.935 1.00 13.41 



m 



Al OM 


3258 


N 


LEU 


478 


ATOM 


3259 


CA 


LEU 


478 


ATOM 


3260 


CB 


LEU 


478 


A1UM 


3261 


CG 


LEU 


478 


Al UM 


3262 


CD1 


LEU 


478 


A1UM 


3263 


CD2 


LEU 


478 


Al UM 


3264 


C 


LEU 


478 


Al UM 


3265 


0 


LEU 


478 


ATOM 


3266 


N 


MET 


479 


A1UM 


3267 


CA 


MET 


479 


ATOM 


3268 


CB 


MET 


479 


ATOM 


3269 


CG 


MET 


479 


ATOM 


3270 


SD 


MET 


479 


Al UVd 


O O "7 1 

3271 


CE 


MET 


' 479 


ATOM 


3272 


C 


MET 


479 


ATOM 


3273 


O 


MET 


479 


ATOM 


3274 


N 


LYS 


480 


ATOM 


3275 


CA 


LYS 


480 


ATOM 


3276 


CB 


LYS 


480 


ATOM 


3277 


CG 


LYS 


480 



ATOM 327 8 CD LYS 4 80 

ATOM 327 9 CE LYS 4 80 

ATOM 32 80 NZ LYS 4 80 

ATOM 3281 C LYS 4 80 

ATOM 3282 O LYS 4 80 

ATOM 328 3 N MET 4 81 

ATOM 3284 CA MET 4 81 

ATOM 328 5 CB MET 4 81 

ATOM 328 6 CG MET 481 

ATOM 328 7 SD MET 4 81 

ATOM 3288 CE MET 4 81 

ATOM 328 9 C MET 4 81 

ATOM 3290 O MET 481 

ATOM 3291 N ALA 4 82 

ATOM 32 92 CA ALA 4 82 

ATOM 3293 CB ALA 4 82 



ATOM 


3294 


C 


*XA 


482 


ATOM 


3295 


O 


ALA 


482 


ATOM 


3296 


N 


LEU 


483 


ATOM 


3297 


CA 


LEU 


483 


ATOM 


3293 


CB 


LEU 


483 


ATOM 


3299 


CG 


LEU 


483 


ATOM 


3300 


CD1 


LEU 


483 


ATOM 


3301 


CD2 


LEU 


483 


ATOM 


3302 


C 


LEU 


483 


ATOM 


3303 


O 


LEU 


483 


ATOM 


3304 


N 


ASP 


484 


ATOM 


3305 


CA 


ASP 


484 


ATOM 


3306 


CB 


ASP 


484 


ATOM 


3307 


CG 


ASP 


484 


ATOM 


3308 


OD1 


ASP 


484 


ATOM 


3309 


OD2 


ASP 


484 


ATOM 


3310 


C 


ASP 


484 


ATOM 


3311 


O 


ASP 


484 


ATOM 


3312 


N 


LYS 


485 


ATOM 


3313 


CA 


LYS 


485 


ATOM 


3314 


CB 


LYS 


485 


ATOM 


3315 


CG 


LYS 


485 


ATOM 


3316 


CD 


LYS 


485 


ATOM 


3317 


CE 


LYS 


485 


ATOM 


3318 


NZ 


LYS 


485 




54.220 32.758 25.964 1.00 14.15 
53.155 33.750 25.917 1.00 14.48 
53.317 34.647 24.688 1.00 12.70 
54.454 35.664 24.818 1.00 13.21 
54.637 36.396 23.508 1.00 13.86 
54.137 36.649 25.946 1.00 14.40 
51.792 33.071 25.904 1.00 13.02 
50.833 33.585 26.476 1.00 11.74 
51.708 31.915 25.253 1.00 13.08 
50.448 31.183 25.199 1.00 14.20 
50.547 29.989 24.242 1.00 14.09 
49.301 29.106 24.220 1.00 1-3.18 
47.788 30.012 23.801 1.00 19.98 
48.041 30.307 22.061 1.00 13.75 
50.105 30.688 26.599 1.00 14.85 
48.958 30.778 27.035 1.00 14.35 
51.102 30.165 27.304 1.00 15.49 
50.868 29.669 28.648 1.00 17.00 
52.136 29.016 29.221 1.00 19.28 
51.911 28.409 30.600 1.00 26.30 
52.934 27.329 30.981 1.00 32.36 
54.279 27.910 31.400 1.00 35.11 
55.168 26.878 32.033 1.00 35.89 
50.395 30.804 29.553 1.00 14.63 
49.513 30.613 30.386 1.00 16.02 
50.971 31.988 29.374 1.00 13.07 
50.602 33.147 30.178 1.00 12.02 
51.612 34.282 29.976 1.00 12.55 
53.015 33.998 30.509 1.00 11.44 
53.051 33.565 32.265 1.00 18.83 
52.416 35.085 33.001 1.00 12.30 
49.202 33.644 29.821 1.00 12.65 
43.430 34.045 30.696 1.00 12.46 
48.886 33.619 28.531 1.00 10.95 
47.588 34.067 28.046 1.00 11.63 
47.596 34.125 26.530 1.00 11.09 
46.463 33.152 28.524 1.00 13.21 
45.372 33.620 28.861 1.00 10.82 
46.722 31.848 28.538 1.00 11.90 
45.712 30.897 28.978 1.00 14.01 
46.260 29.468 28.910 1.00 11.66 
46.455 28.896 27.498 1.00 10.77 
47.098 27.517 27.587 1.00 8.72 

45.118 28.813 26.785 1.00 7.73 
45.268 31.232 30.401 1.00 14.37 
44.158 30.898 30.814 1.00 14.12 
46.142 31.894 31.147 1.00 15.19 
45.810 32.281 32.511 1.00 17.15 
47.038 32.146 33.416 1.00 19.74 
46.787 32.653 34.830 1.00 26.92 
45.717 32.353 35.399 1.00 31.69 
47.667 33.344 35.383 1.00 30.93 
45.271 33.711 32.560 1.00 16.03 
44.101 33.927 32.860 1.00 17.57 

46.119 34.676 32.217 1.00 14.15 
45.772 36.094 32.265 1.00 12.86 
47.042 36.932 32.080 1.00 12.78 
48.152 36.589 33.070 1.00 13.51 
47.721 36.847 34.509 1.00 13.07 
48.838 36.502 35.488 1.00 14.28 
48.482 36.832 36.899 1.00 12.83 



ATOM 3319 C LYS 4 85 

ATOM 3320 O LYS 4 85 

ATOM 3321 N ILE 4 86 

ATOM 3322 CA ILE 4 86 

ATOM 3323 CB ILE 48 6 

ATOM 3324 CG2 ILE 486 

ATOM 3325 CGI ILE 486 

ATOM 332 6 CD1 ILE 486 

ATOM 3327 C ILE 48 6 

ATOM 3328 O ILE 48 6 

ATOM 3329 N . ALA 4 87 

ATOM 3330 CA ALA 4 87 

ATOM 3331 CB ALA 4 87 

ATOM 3332 C ALA 4 87 

ATOM 3333 O ALA 4 87 

ATOM 3334 N PHE 488 

ATOM 3335 CA PHE 488 

ATOM 3336 CB PHE 4 88 

ATOM 3337 CG PHE 488 

ATOM 3338 CD1 PHE 488 

ATOM 3339 CD2 PHE 488 

ATOM 3340 CE1 PHE 488 

ATOM 3341 CE2 PHE 4 88 

ATOM 334 2 CZ PHE 4 88 . 

ATOM 3343 C PHE 488 

ATOM 334 4 O PHE 4 88 

ATOM 334 5 N ILE 48 9 

ATOM 334 6 CA ILE 48 9 

ATOM 3347 CB ILE 48 9 

ATOM . 334 8 CG2 ILE 489 

ATOM 334 9 CGI ILE 48 9 

ATOM 335 0 CD1 ILE 48 9 

ATOM 3351 C ILE 489 

ATOM 3352 O ILE 489 

ATOM 3353 N PRO 4 90 

ATOM 335 4 CD PRO 4 90 

ATOM 3355 CA PRO 4 90 

ATOM 3356 CB PRO 4 90 

ATOM 3357 CG PRO 4 90 

ATOM 3358 C PRO 4 90 

ATOM 3359 O PRO 4 90 

ATOM 3360 N PHE 4 91 

ATOM 3361 CA PHE 4 91 

ATOM 3362 CB PHE 4 91 

ATOM 3363 CG PHE 4 91 

ATOM 3364 CD1 PHE 4 91 

ATOM 3365 CD2 PHE 4 91 

ATOM 3366 CE1 PHE 4 91 

ATOM .3367 CE2 PHE 491 

ATOM 3368 CZ PHE 4 91 
ATOM 3369 C PHE 4 91 
ATOM 3370 O PHE 4 91 
ATOM 3371 N SER 4 92 
ATOM 3372 CA SER 4 92 
ATOM 3373 CB SER 4 92 
ATOM 337 4 OG SER 4 92 
ATOM 3375 C SER 4 92 
ATOM 337 6 O SER 4 92 
ATOM 3377 N TYR 4 93 
ATOM 337 8 CA TYR 4 93 
ATOM 337 9 CB TYR 4 93 
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44.690 36.616 31.319 1.00 12.35 
43.823 37.382 31.739 1.00 12.38 
44.744 36.233 30.047 1.00 11.02 
43.753 36.707 29.084 1.00 11,30 
44.243 36.506 27.632 1.00 10.39 
43.126 36.849 26.649 1.00 9.22 
45.487 37.366 27.376 1.00 9.66 
45.269 38.863 27.555 1.00 12.52 
42.400 36.016 29.255 1.00 10.85 
41.354 36.664 29.228 1.00 9.72 
42.418 34.700 29.433 1.00 11.32 
41.176 33.954 29.610 1.00 10.28 
41.484 32.464 29.773 1.00 9.81 
40.390 34.462 30.823 1.00 10.73 
39.161 34.432 30.836 1.00 11.37 
41.110 34.937 31.834 1.00 10.32 
40.500 35.426 33.067 1.00 11.62 
41.574 35.560 34.149 1.00 11.67 
41.027 35.846 35.523 1.00 14.14 
40.310 34.873 36.222 1.00 13.10 
41.256 37.075 36.134 1.00 13.14 
39.834 35.122 37.510 1.00 13.40 
40.788 37.332 37.414 1.00 11.72 
40.076 36.355 38.107 1.00 13.66 
39.765 36.763 32,931 1.00 11.30 
38.809 37.022 33.662 1.00 10.92 
40.208 37.604 32.000 1.00 11.64 . 
39.600 38.915 31.807 1.00 10.62 
40.193 39.615 30.568 1.00 12.52 
39.378 40.870 30.215 1.00 11.83 
41.652 39.993 30.842 1.00 14.27 
41.824 41.029 31.952 1.00 13.73 
38.067 38.919 31.723 1.00 12.80 
37.406 39.590 32.514 1.00 12.29 
37.482 38.183 30.764 1.00 11.12 
38.069 37.429 29.644 1.00 10.46 
36.020 38.185 30.683 1.00 10.14 
35.742 .37.410 29.390 1.00 10.83 
36.942 36.516 29.258 1.00 12.17 
35.325 37.586 31.911 1.00 9.58 
34.257 38.049 32.306 1.00 8.62 
35.925 36.568 32.521 1.00 6.98 
35.321 35.969 33.709 1.00 9.75 
36.067 34.703 34.137 1.00 8.88 
35.512 34.074 35.393 1.00 9.15 
34.374 33.271 35.344 1.00 8.95 
36.109 34.311 36.630 1.00 7.55 
33.837 32.713 36.508 1.00 9.82 
35.580 33.757 37.803 1.00 9.83 
34.442 32.957 37.740 1.00 8.83 
35.329 36.948 34.885 1.00 9.97 
34.312 37.127 35.558 1.00 10.66 
36.475 37.579 35.132 1.00 9.34 
36.589 38.512 36.249 1.00 11.14 
38.046 38.946 36.439 1.00 8.12 
38.534 39.643 35.307 1.00 11.50 
35.692 39.738 36.083 1.00 11.76 
35.327 40.391 37.064 1.00 13.67 
35.344 40.042 34.840 1.00 11.79 
34.482 41.177 34.527 1.00 12.20 
34.633 41.544 33.048 1.00 12.22 
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ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 
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ATOM 
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ATOM 

ATOM 

ATOM 
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ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM' 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



3380 

3381 

3382 

3383 

3384 

3385 

3386 

3387 

3388 

3389 

3390 

3391 

3392 

3393 

3394 

3395 

3J96 

3397 



CG TYR 
CD1 TYR 
CE1 TYR 
CD2 TYR 
CE2 TYR 
CZ TYR 



OH 
C 

o 

N 

CA 

CB 



TYR 
TYR 
TYR 
LEU 
LEU 
LEU 



CG LEU 
CD1 LEU 
CD2 LEU 
C LEU 



O 
N 



LEU 
VAL 



3398 CA VAL 

3399 CB VAL 

3400 CGI VAL 
34 01 CG2 VAL 
3402 C VAL 

O VAL 
N ASP 



3403 
3404 
3405 
3406 
3407 



CA ASP 
CB ASP 



CG ASP 

3408 OD1 ASP 

3409 OD2 ASP 
C ASP 
O ASP 
N GLN 
CA GLN 
CB GLN 
CG GLN 
CD GLN 



3410 
3411 
3412 
3413 
3414 
3415 
3416 
3417 
3418 
3419 
3420 
3421 
3422 
3423 
3424 
3425 



OE1 GLN 
NE2 GLN 
C GLN 



O GLN 
N TRP 
CA TRP 
CB TRP 
CG TRP 
CD2 TRP 

3426 CE2 TRP 

3427 CE3 TRP 

3428 CD1 TRP 

3429 NE1 TRP 

3430 CZ2 TRP 

3431 CZ3 TRP 
CH2 TRP 
C TRP 
O TRP 
N ARG 



3432 
3433 
3434 
3435 
3436 
3437 
3438 
3439 
3440 



CA ARG 

CB ARG 

CG ARG 

CD ARG 

NE ARG 



493 

493 

493 

493 

493 

493 

493 

493 

493 

494 

494 

494 

494 

494 

494 

494 

494 

495 

495 

495 

495 

495 

495 

495 

496 

496 

496 

496 

496 

496 

496 

496 

497 

497 

497 

497 

497 

497 

497 

497 

497 

498 

498 

498 

498 

498 

498 

498 

498 

498 

498 

4 98 

498 

498 

498 

499 

499 

499 

499 

499 

499 
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33.904 

34.295 

33. 630 

32.821 

32.150 

32.561 

31.912 

33.022 

32.254 

32.670 

31.327 

31.319 

30.282 

30.701 

30.192 

30.741 

29.566 

31.544 

31.019 

32.139 

31.549 

32.833 

30. 197 

29. 047 

30.768 

30.012 

30. 954 

31.513 

31.025 

32.426 

29.015 

28.125 

29.143 

28.150 

28.471 

29. 702 

29. 672 

28. 615 

30. 844 

26. 847 

25.774 

26. 961 

25. 812 

26.189 

25.057 

24. 815 

23. 634 

25. 480 

24. 044 

23. 185 

23. 105 

24. 950 

23. 773 

25.333 

24.146 

26.252 

25.871 

27. 038 

27.106 

27. 992 

29.395 



42. 801 
44.045 
45.210 
42. 751 
43. 912 
45.137 
46.285 
40.811 
41.585 
39. 604 
39.053 
37.739 
36. 622 
35. 590 
35.977 
38.834 
39.112 
38.347 
38.055 
37. 621 
37.318 
36.376 
39. 171 
38. 955 
40.360 
41.435 
42.439 
41.909 
40. 855 
42.532 
42. 125 
42. 837 
41. 923 
42. 511 
42.231 
42. 954 
44 . 459 
45.082 
45.048 
41.804 
42. 403 
40.520 
39.717 
38.231 
37.318 
36.788 
36.009 
36.898 
36.851 
36.063 
35.343 
36.232 
35.465 
40. 151 
40.398 
40.243 
40. 643 
40.468 
39.070 
39.005 
39.250 



32. 642 
33.134 
32.763 
31.764 
31.385 
31.889 
31.510 
34.793 
35.363 
34.372 
34.475 
33. 685 
33.731 
32. 639 
35.100 
35. 874 
36.102 
36.810 
38. 141 
39.109 
40.486 
38.564 
38.785 
39.154 
38. 919 
39. 538 
40.207 
41.526 
41. 993 
42.098 
38. 609 
39.081 
37.298 
36. 400 
34. 926 
34.377 
34 . 613 
34. 607 
34.809 
36. 787 
36.794 
37. 120 
37.537 
37.571 
37. 978 
39.287 
39.213 
40.518 
37. 182 
37. 919 
40.325 
41. 629 
41.520 
38. 928 
39.132 
39. 885 
41.240 
42.222 
42.824 
44.062 
43.748 



1.00 13.21 
1.00 14.67 
1.00 16.44 
1.00 13.82 
1.00 16.41 
1.00 15.04 
1. 00 14.56 
1. 00 13.05 
1. 00 13.30 
1.00 13.45 
1. 00 13.55 
1. 00 16.41 
1.00 20.60 
1. 00 21. 31 
1.00 15.36 



1, 
1, 
1. 
1. 
1. 



.00 11.95 
.00 11.20 
.00 10.36 
00 10.58 
,00 9.66 
1.00 9.24 
1.00 12.37 
1.00 9.62 



1. 00 
1. 00 



7.39 
9.30 



1.00 10.89 
1.00 10.57 



1. 00 
1.00 
1.00 
1. 00 



9.77 
9.79 
9.45 
11.41 



1.00 11.26 

1.00 9.61 

1.00 11.24 

1.00 11.55 

1.00 13.89 

1.00 16.12 

1.00 14.20 

1.00 17.36 

1.00 10.45 

1.00 10.85 

1.00 9.86 

1.00 10.64 

1.00 9.73 

1.00 9/08 

1.00 9.94 

1.00 11.43 
1 
1 
1. 
1. 
1. 



00 9.53 
00 10.54 
00 9.96 
00 10.04 
00 13.16 



1.00 12.54 



00 9.68 
00 10.04 
00 10.21 
00 11.86 
00 11.07 
1.00 12.83 
1.00 12.47 
1.00 10.15 



1 
1 

1, 
1. 
1. 
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ATOM 


3447 


CA 


TRP 




ATOM 
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CB 


TRP 




ATOM 


3449 


CG 


TRP 


10 


ATOM 


3450 


CD2 


TRP 




ATOM 


3451 


CE2 


TRP 




ATOM 


3452 


CE3 


TRP 




ATOM 


3453 


CD1 


TRP 




ATOM 


3454 


NE1 


TRP 


15 


ATOM 


3455 


CZ2 


TRP 




ATOM 


3456 
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TRP 




ATOM 


3457 


CH2 


TRP 
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3458 
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TRP 
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3459 
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TRP 


20 


ATOM 
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CA 


ARG 
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CB 


ARG 
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CG 


ARG 




ATOM 


3464 


CD 


ARG 


25 


ATOM 


3465 


NE 


ARG 
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ARG 
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ARG 
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3468 
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ARG 
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3469 
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ARG 
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j \j 


ATOM 


3470 


o 
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ATOM 


3471 


N 


VAL 




ATOM 


3472 


CA 


VAL 




ATOM 


3473 


CB 


VAL 




ATOM 


3474 


CGI 


VAL 


35 
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30.409 38.744 44.439- 1.00 12.88 
30.171 37.961 45.491 1.00 10.18 
31.657 39.000 44.067 1.00 10.68 
25.325 42.065 41.336 1.00 12.18 
24.395 42.311 42.098 1.00 11.36 
25.887 42.996 40.568 1.00 11.77 
25.391 44.368 40.605 1.00 12.08 
26.202 45.289 39.684 1.00 10.66 
27.602 45.521 40.139 1.00 10.40 
28.677 46.051 39.357 1.00 11.41 
29.821 46.074 40.183 1.00 11.19 
28.786 46.504 38.034 1.00 11.44 
28.119 45.261 41.374 1.00 9.78 
29.451 45.589 41.409 1.00 10.36 
31.062 46.536 39.731 1.00 10.56 
30.021 46.963 37.585 1.00 11.49 
31.143 46.972 38.434 1.00 9.70 
23.933 44.407 40.174 1.00 13.08 
23.143 45.188 40.703 1.00 12.40 
23.581 43.565 39.207 1.00 11.92 
22.209 43.525 38.719 1.00 12.19 
22.176 42.962 37.298 1.00 13.67 
22.833 43.881 36.281 .1.00 17.51 
22.956 43.199 34.929 1.00 23.14 
21.650 42.807 34.407 1.00 28.87 
21.443 41.731 33.656 1.00 30.18 
22.458 40.936 33.340 1.00 30.67 
20.222 41.448 33.223 1.00 31.24 
21.296 42.732 39.653 1.00 10.77 
20.081 42.891 39.626 1.00 10.89 
21.884 41.865 40.468 1.00 9.90 
21.100 41.120 41.439 1.00 10.52 
21.896 39.940 42.034 1.00 9.49 
21.205 39.425 43.285 1.00 8.80 
22.008 38.812 41.002 1.00 12.29 
20.786 42.126 42.551 1.00 12.15 
19.643 42.245 42.999 1.00 10.37 
21.815 42.865 42.965 1.00 11.50 
21.678 43.864 44.018 1.00 11.51 
23.057 44.434 44.386 1.00 10.31 
23.939 43.463 45.144 1.00 11.73 
25.324 43.571 45.085 1.00 11.85 
23.384 42.451 45.925 1.00 12.62 
26.146 42.693 45.787 1.00 10.68 
24.199 41.564 46.633 1.00 10.51 
25.582 41.686 46.563 1.00 12.31 
20.711 44.995 43.661. 1.00 10.96 
19.896 45.384 44.497 1.00 9.00 
20.779 45.519 42.435 1.00 12.29 
19.866 46.599 42.058 1.00 12.07 
20.451 47.479 40.931 1.00 13.57 
20.413 46.822 39.548 1.00 13.98 
19.775 45.766 39.358 1.00 14.51 
21.033 47.396 38.633 1.00 15.18 
18.470 46.107 41.685 1.00 13.54 
17.615 46.892 41.278 1.00 13.18 
18.239 44.805 41.833 1.00 14.58 
16.929 44.244 41.540 1.00 15.30 
16.590 43.920 40.095 1.00 16.54 
15.457 43.528 39.808 1.00 17.31 
17.547 44.088 39.185 1.00 15.04 
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21.672 28.859 36.980 1.00 11.83 
22.613 29.054 37.743 1.00 14.77 
21.644 27.848 36.118 1.00 7.41 
20.872 31.572 35.299 1.00 12.10 
21.918 31.536 34.637 1.00 10.25 
19.690 31.882 34.775 1.00 11.72 
19.568 32.194 33.362 1.00 14.72 
18.089 32.356 32.979 1.00 17.46 
17.260 31.066 33.144 1.00 23.70 
16.576 30.934 34.510 1.00 25.81 
17.121 31.322 35.552 1.00 21.12 
15.375 30.363 34.502 1.00 29.42 
20.369 33.446 32.994 1.00 14.10 
20.973 33.511 31.918 1.00 13.15 
20.391 34.432 33.887 1.00 13.07 
21.136 35.660 33.615 1.00 12.06 
20.656 36.802 34.519 1.00 16.07 
19.244 37.290 34.190 1.00 18.59 
19.084 37.659 32.719 1.00 24.12 
19.829 38.544 32.235 1.00 24.75 
18.214 37.060 32.046 1.00 26.20 
22.626 35.420 33.805 1.00 10.94 
23.454 36.089 33.190 1.00 12.49 

22.970 34.466 34.660 1.00 9.41 
24.374 34.135 34.873 1.00 9.43 

24.513 33.135 36.030 1.00 9.02 
25.865 32.477 36.148 1.00 10.43 
27.026 33.004 36.804 1.00 10.48 
28.037 32.019 36.718 1.00 10.91 
27.311 34.210 37.458 1.00 10.58 
26.212 31.235 35.697 1.00 11.18 

27.514 30.951 36.038 1.00 12.53 
29.314 32.201 37.263 1.00 11.44 
28.583 34.393 38.002 1.00 11.75 
29.568 33.390 37.899 1.00 11.65 
24.912 33.542 33.575 1.00 9.00 

25.971 33.943 33.088 1.00 10.05 
24.167 32.605 32.993 1.00 9.12 
24.609 31.995 31.746 1.00 9.56 
23.855 30.689 31.497 1.00 8.95 
24.458 29.614 32.334 1.00 7.44 
25.729 28.992 32.118 1.00 8.27 
25.994 28.176 33.240 1.00 7.79 
26.678 29.052 31.084 1.00 9.26 
24.007 29.154 33.540 1.00 6.42 
24.927 28.292 34.093 1.00 8.16 
27.166 27.424 33.359 1.00 9.24 
27.848 28.301 31.204 1.00 10.00 
28.080 27.500 32.333 1.00 8.75 
24.549 32.927 30.538 1.00 10.22 
25.321 32.769 29.593 1.00 10.24 
23.654 33.908 30.566 1.00 11.02 
23.588 34.863 29.467 1.00 13.01 
22.407 35.823 29.631 1.00 14.72 
21.187 35.175 29.332 1.00 22.10 
24.887 35.663 29.462 1.00 11.80 
25.375 36.072 28.405 1.00 10.96 
25.441 35.889 30.650 1.00 10.02 
26.685 36.638 30.763 1.00 11.52 
26.828 37.219 32.173 1.00 11.07 
25.804 38.316 32.491 1.00 13.05 
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43.926 57.257 47.987- 1.00 21.14 
43.486 59.283 47.141 1.00 21.16 
43.484 60.757 47.074 1.00 20.84 
42 742 58.700 46.016 1.00 20.12 
42.380 59.920 45.171 1.00 21.00 
42.288 61.033 46.199 1.00 22.41 
43.717 57.765 45.294 1.00 18.21 
44.879 58.123 45.108 1.00 16.62 
43.256 56.588 44.871 1.00 16.20 
44.155 55.631 44.222 1.00 14.88 
43.404 54.352 43.807 1.00 11.66 
42.419 54.500 42.687 1.00 11.73 
42.712 54.543 41.285 1.00 10.38 
41.478 54.643 ■ 40.605 1.00 9.76 
43 897 54.513 40.537 1.00 11.00 
41.059 54.575 42.797 1.00 11.94 
40.487 54.657 41.551 1.00 12.01 
41.393 54.706 39.211 1.00 10.56 
43.813 54.577 39.147 1.00 12.40 
42.569 54.675 38.500 1.00 10.93 
44.990 56.138 43.048 1.00 14.01 
46.088 55.639 42.819 1.00 14.77 
44.491 57.132 42.287 1.00 14.29 
43.145 57.734 42.275 1.00 12.19 
45.286 57.629 41.158 1.00 15.72 
44.432 58.770 40.613 1.00 15.01 
43.046 58.272 40.859 1.00 14.40 
46.687 58.091 41.559 1.00 17.03 
47 600 58.098 40.740 1.00 16.15 
46.852 58.481 42.818 1.00 17.28 
48.157 58.925 43.298 1.00 19.19 
48.017 59.648 44.639 1-00 21.09 
47 456 61.060 44.503 1-00 25.25 
47.359 61.785 45.837 1.00 27.88 
48.311 61.683 46.641 1.00 28.02 
46.334 62.462 46.075 1.00 29.69 
49.106 57.739 43.430 1.00 17.57 
50.268 57.828 43.052 1.00 18.96. 
48.611 56.626 43.962 1.00 16.29 
49 438 55.434 44.094 1-00 16.79 
48.693 54.353 44.871 1.00 14.91 
49.775 54.947 42.683 1.00 16.09 
50.886 54.488 42.425 1.00 16.14 
48.810 55.056 41.772 1.00 15.85 
49.024 54.649 40.383 1.00 15.49 
47.742 54.841 39.569 1.00 14.89 
47 885 54.548 38.068 1.00 12.61 
48.373 52.846 37.686 1.00 14.89 
46.752 52.007 37.746 1.00 9.67 
50.152 55.486 39.769 1.00 17.04 
51.018 54.963 39.063 1-00 16.25 
50.131 56.787 40.044 1.00 16.65 

51 149 57.697 39.527 1.00 17.56 
50.798 59.148 39.865 1.00 21.08 
51.631 60.167 39.093 1.00 24.06 
50 775 61.060 38.219 1.00 27.95 
50 227 62.066 38.675 1.00 30.67 
50 637 60.684 36.959 1.00 31.02 

52 521 57.362 40.103 1.00 18.21 
53*.513 57.324 39.375 1.00 17.58 
52 574 57.120 41.410 1.00 17.37 
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GLN 
GLN 
GLN 
GLN 
PRO 
PRO 
PRO 
PRO 
PRO 
PRO 
PRO 
ASN 
ASN 
ASN 
ASN 
£SN 
ASN 
ASN 
ASN 
MET 
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53.831 
53.593 
53.385 
53.020 
54.660 
54.517 
55.743 
53.728 
54.289 
53.356 
53. 944 
53.158 
52.075 
54.562 
55. 643 
53.586 
53. 684 
52.368 
51.341 
51. 941 
54 . 035 
54.323 
54.000 
54.315 
53. 126 
53.256 
51. 965 
50.736 
50.551 
50.422 
49.041 
48 . 045 
48.966 
49. 622 
49.815 
48 . 443 
48 . 129 
47.357 
46. 829 
46. 800 
46.202 
45.390 
46. 108 
44.985 
44.495 
44 .216 
44 .489 
43.658 
45.226 
46.364 
44 . 124 
44 . 179 
42.868 
42. 667 
41. 194 
39. 927 
44.316 
43.788 
45.013 
45. 194 
46. 309 



56.775 
56.498 
57.741 
57.324 
58.580 
55.567 
55.524 
54 .585 
53.384 
52. 167 
50. 944 
51.888 
50.365 
53.505 
53.151 
54.018 
54.108 
53.622 
54.594 
52.289 
55.452 
55.502 
56.538 
57.836 
58. 496 
59.556 
57.854 
58. 391 
57.895 
56.397 
55. 919 
56.448 
54 . 919 
57.957 
57.042 
58.599 
59.856 
58.2*8 
59. 617 
60.287 
57 .320 
56. 988 
56.904 
56.072 
56.545 
58 .029 
58.692 
58.557 
54 .573 
54 .105 
53.829 
52.387 
51.709 
51.565 
50.589 
51.842 
52.322 
53.187 
51.318 
51.215 
52.168 



42.069 
43.560 
44 . 430 
45.839 
44.431 
41.434 
41.319 
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40. 609 
39.917 
42.099 
42.391 
38. 907 
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36. 079 
36.320 
36.701 
36.059 
34.860 
36. 826 
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35.572 
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35.288 
36.917 
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35. 933 
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37. 409 
35.289 
34. 869 
33.501 
33. 462 
32. 463 
34.542 
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19.20 
20.05 
22. 61 
22.10 
21.77 
18. 06 
17.10 
17.78 
15.89 
17.36 
17.31 
17. 99 
16. 14 
16. 86 
18.03 
16.36 
15.96 
14.90 
12.04 
12. 12 
17.25 
16.91 
16. 88 
16.24 
17.22 
16.05 
17.32 
15. 68 
15.80 
16.06 

14. 47 
14.28 
12. 42 
14 . 75 
14 . 61 
14.37 
15.57 
14 . 41 

15. 41 
14.41 
14.55 
13.74 
14.76 
16.09 
17.02 
18.18 
22. 94 
18. 54 
14.02 
13.88 
13.32 
14. 06 

11. 91 
12.29 
13.39 
14 . 36 
14.48 
13.59 

12. 87 
13.47 
13.34 
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46.562 
45.540' 
46.338 
44 . 937 
45.177 
43. 968 
46. 433 
46.845 
47.036 
48.239 
48.788 
47.972 
49.361 
49. 919 
49. 697 
50.761 
50.945 
50.476 
51.344 
49.261 
48 . 882 
47.501 
47 .048 
45 . 490 
45.878 
48.892 
49.352 
48.397 
48.392 
47.622 
4 6.108 
45 . 483 
45. 496 
49.824 
50.141 
50.688 
52.088 
52.788 
54.196 
52.777 
53.544 
52. 490 
53.055 
52.487 
53.024 
54.061 
54.538 
52. 481 
52.951 
53.975 
54.425 
52.721 
53.585 
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51.021 
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40.679 
40.733 
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4665 

4666 
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4671 
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4678 

4679 
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4685 

4686 

4687 

4688 

4689 

4690 

4691 

4692 
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4695 
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4702 
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4710 
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OH2 WAT 
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OH2 WAT 
OH2 WAT 
OH2 WAT 
OH 2 WAT 
OH2 WAT 
OH2 WAT 
OH2 WAT 
OH2 WAT 
OH 2 WAT 
OH2 WAT 
OH 2 WAT 
OH2 WAT 
OH2 WAT 
OH2 WAT 
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OH2 WAT 
OH2 WAT 
OH 2 WAT 
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OH2 WAT 
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OH2 WAT 
OH 2 WAT 
OH2 WAT 
OH2 WAT 
OH 2 WAT 
OH2 WAT 
OH 2 WAT 
OH2 WAT 
OH2 WAT 



705 
706 
707 
708 
709 
710 
711 
712 
713 
714 
715 
.716 
717 
718 
719 
720 
721 
722 
723 
724 
725 
726 
727 
728 
729 
* 730 
731 
732 
734 
735 
736 
737 
738 
739 
740 
741 
742 
743 
744 
745 
746 
747 
748 
749 
750 
751 
752 
753 
754 
755 
756 
757 
758 
759 
760 
761 
762 
763 
764 
765 
766 



33.593 

32.965 

24.287 

41. 492 

57.508 

39.412 

51.091 

53.042 

52.228 

16.492 

47 . 496 

33.685 

33.197 

37 . 259 

17 .570 

40.862 

37 . 202 

48.308 

44.174 

37.635 

31. 657 

48.823 

43.137 

36.288 

45.181 

19.065 

18.532 

16.291 

27. 683 

43.137 

23. 024 

40. 667 

44.387 

42. 945 

36.456 

39. 950 

47. 924 

48.523 

16.710 

50.722 

42.905 

43.215 

37 .286 

31.745 

50.872 

23.924 

46.065 

17.359 

30.402 

30.166 

40.401 

48.250 

23.481 

39.537 

41.465 

50.211 

55.270 

17.787 

41.349 

31.365 

35.820 



46. 605 
26.053 
47 .413 
22. 684 
41.508 
23.942 
36.475 
51.060 
34.436 
44 .739 
46.775 
54.185 
41.542 
27.514 
40.507 
49. 469 
32.888 
52.095 
25.251 
28.226 
44.896 
40.553 
28.430 
33.431 
39.323 
49. 680 
48.145 
39.074 
37.166 
42.880 
50.329 
28 .535 
45.117 
25.383 
23.986 
39.788 
45.637 
42.505 
44.022 
52.452 
51.202 
36.931 
29.700 
23.237 
32.438 
24.786 
55.788 
38.412 
54.271 
28.862 
56.296 
46.266 
22.074 
29.868 
55.169 
32.756 
39.776 
47. 082 
43.344 
48.033 
29.185 



44.423 
34.132 
42.078 
37 . 822 
37. 660 
39.181 
38.087 
33.144 
36.844 
50.912 
31.239 
42. 672 
38.210 
46.151 
43.225 
48. 991 
29.681 
48.720 
17.566 
16.249 
43.119 
46.220 
14 . 664 
19.194 
41.443 
43.390 
47.790 
47.581 
46. 929 
22.592 
41.657 
40.057 
56. 435 
33.384 
32.241 
48 . 186 
61. 414 
62. 565 
47.985 
33.013 
39.594 
50.988 
43.929 
13.527 
13.257 
40. 648 
48.966 
45.074 
37.540 
39.171 
45. 617 
58. 848 
39.456 
42.153 
34.794 
35.889 
52.027 
45.179 
47.624 
33.699 
13.398 



1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 



00 
00 
00 
00 
00 
00 
00 
00 
00 
1.00 
1.00 
1.00 
1. 00 
1. 00 
1.00 



00 
00 
00 
00 
00 
00 
00 
00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 



9.31 
8.96 
8.51 
11.40 
13.76 
8.91 
10.07 
16.13 
10.52 
12.73 
10.24 
16.09 
12.21 
11.50. 
12.99 
9.86 
9.25 
13.18 
11. 48 
13. 61 
8.30 
14.22 
15. 95 
9. 04 
15.73 
14 . 42 
14 . 97 
12.27 
9. 69 
14.48 
17.32 
16.00 
14.17 
13.52 
8.95 
12.78 
20.16 
17 .18 
15.86 
14 .62 
14 . 40 
16. 61 
19. 60 
19.26 
13.65 
14 .00 
12.21 
16.22 
19.90 
11.86 
19.60 
15.18 
13.04 
13.71 
16.56 
22.03 
16. 82 
17.96 
13.85 
17.60 
14.60 
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770 
771 
772 
773 
774 
775 
776 
777 
778 
779 
780 
781 
782 
783 
784 
785 
786 
787 
788 
789 
790 
791 
7 92 
793 
794 
795 
796 
797 
798 
799 
800 
801 
802 
803 
804 
805 
806 
807 
808 
809 
810 
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813 
814 
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816 
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818 
819 
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823 
824 
825 
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42.075 46.950 49.736 1.00 18.28 
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40. 009 
52.878 
39.380 
34.750 
17.366 
44.804 
31.146 
46.715 
42. 869 
40.831 
35.162 
14.809 
39.527 
27.757 
49.394 
44 .777 
57.867 
34.155 
45. 141 
43.475 
20.668 
32.083 
49. 913 
42.785 
39.850 
27.466 
40.190 
39.109 
27 .558 
66.163 
22.819 
25. 950 
34 .253 
59. 840 
38.496 
40.729 
56.117 
37.047 
42.476 
35.171 
29.225 
52.753 
53.307 
'11. 971 
24 . 990 
43. 951 
36.259 
35.820 
39. 436 
62.583 
19.790 
50. 697 
27.847 
27. 949 
55.965 
9. 432 
41. 473 
50. 992 
26.053 
69.827 



19.530 

28.814 

57. 861 

20.720 

31.093 

23. 924 

56.380 

56.401 

58.288 

33.557 

52.317 

39.247 

31. 879 

50. 041 

26.247 

26.399 

38.875 

51. 047 

27. 154 

51.366 

25.724 

32.320 

48.449 

27.563 

53. 7fi0 

13. 978 

34. 656 

10. 935 

21.410 

40. 174 

49. 816 

37.105 

10. 439 

33. 669 

20.697 

21.319 

41. 173 

26. 323 

46. 612 

28.146 

19. 649 

51.832 

37.708 

34.982 

48.789 

19.262 

14 . 934 

12. 642 

55.386 

34.233 

54.852 

29.832 

56.381 

17.733 

37.522 

31.570 

54 .399 

18.343 

14 .477 

44 .579 



28.487 
13.591 
40.522 
49.137 
38.464 
14.000 
49.556 
46.506 
37.166 
7.130 
73.577 
36.520 
53.125 
53.223 
11.787 
15.119 
52.198 
61. 985 
46.412 
28.087 
29.782 
15.051 
23.710 
47. 669 
46.033 
27. 601 
11.736 
33.519 
51.556 
37.797 
39.224 
65.162 
28.187 
44 . 222 
31. 902 
12.133 
25.206 
50. 890 
21.569 
10. 829 
23.377 
57. 912 
61.763 
48.268 
37.974 
20. 770 
14 .740 
25.488 
36.440 
30.419 
50.868 
12.471 
70.315 
50.193 
32. 909 
48.180 
62.185 
24 .030 
12.4 60 
41.736 



1.00 17.39 
1.00 22.84 
1.00 20.88 
1.00 24.09 
1.00 23.54 
1.00 19.88 
1.00 21.05 
1.00 19.22 
1.00 15.61 
1.00 24.79 
1.00 20.05 



1 

1. 

1. 

1. 

1. 



00 18.70 
00 13.72 
00 27.53 
00 19.81 
00 21.30 
1.00 18.47 
1.00 19.93 
1.00 22.42 
1.00 18.86 
1.00 24.21 
1.00 19.38 
1.00 18.63 
1.00 25.09 
1.00 22.72 
1.00 20.70 
1.00 32.79 
1.00 22.45 
1.00 17.72 



00 19.68 
00 23.29 
00 25.58 
00 25.36 
00 19.20 
00 16.24 
00 28.36 



1.00 19.73 
1.00 22.30 
1.00 21.24 
1.00 23.83 
1.00 20.44 
00 24.14 
00 20.90 



.00 25.05 
.00 24.58 
00 21.14 
00 22.42 
00 20. 65 
00 12.53 
00 22.07 
00 29.06 
00 22.39 



1.00 23.46 
1.00 31.12 
1.00 17.65 
1.00 32.33 
1.00 30.40 
1.00 25.41 
1.00 20.39 
1.00 22.45 
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67.822 43.137 
28 . 879 64 . 576 
50.121 23.822 
50.504 23.835 
50.897 18.482 
54.681 37.002 
23.493 23.940 
33.356 54.985 
43.404 31.760 
28.231 42.794 
14.847 43.548 
14.659 39.207 
57.510 37.601 
20.651 32.033 
52.056 50.326 
51.657 35.863 
47.279 21.024 
29.576 44.197 
40.524 18.980 
56.572 42.868 
34.210 18.018 
27.258 18.647 
62.773 54.373 
35.089 9.269 
38.082 57.888 
20.570 20.572 
64 .060 48.729 
61.993 33.237 
46.176 15.310 
14.968 50.857 
21.704 24.366 
39.839 19.775 
44.365 68.561 




16.664 1.00 27.42 

49.712 1.00 24.06 

51.058 1.00 19.06 

59.470 1.00 23.51 

64J227 1.00 34.99 

42.450 1.00 37.42 

12.538 1.00 29.15 

51.137 1.00 26.94 

37.851 1.00 23.28 

58.259 1.00 30.21 

50.490 1.00 30.20 

32.693 1.00 26.12 

54.146 1.00 26.40 

54.478 1.00 19.11 

32.080 1.00 20.99 

44.501 1.00 22-. 93 

67.124 1.00 40.66 

37.265 1.00 42.07 

57.585 1.00 24.15 

32.120 1.00 36.71 
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26.109 1.00 26.96 

39.531 1.00 36.25 

50.485 1.00 38.55 

33.414 1.00 23.60 

35.734 1.00 30.32 

47.598 1.00 25.68 

54.293 1.00 38.10 

31.633 1.00 32.63 

53.881 1.00 42.97 

51.887 1.00 36.24 

14.153 1.00 24.51 

56.122 1.00 31.90 
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47.553 30.301 39.926 1.00 25.30 

32.268 20.759 14.108 1.00 23.29 
69.343 41.988 48.078 1.00 32.22 
24.297 15.185 41.737 1.00 31.91 
30.655 11.104 25.446 1.00 26.78 
42.362 40.299 57.293 1.00 20.70 
41.774 36.340 13.554 1.00 34.25 
39.152 38.465 60.315 1.00 25.51 
41.147 24.781 9.266 1.00 37.67 
69.407 36.691 46.483 1.00 26.53 
55.637 47.942 26.959 1.00 32.37 
33.913 34.460 58.201 1.00 38.45 

56.269 58.664 39.628 1.00 38.97 
57.309 45.050 24.747 1.00 30.45 
67.831 30.631 34.349 1.00 30.40 
17.422 51.937 47.071 1.00 26.07 
35.633 30.975 66.612 1.00 24.07 
61.975 46.612 53.486 1.00 26.87 
47.029 38.251 67.425 1.00 30.22 
60.210 33.638 26.728 1.00 31.35 
17.482 36.558 63.602 1.00 44.50 
24.128 36.324 26.146 1.00 20.37 
15.719 30.346 47.149 1.00 24.17 
23.359 62.099 63.265 1.00 40.09 
50.740 48.721 64.156 1.00 39.37 
64.987 41.178 33.243 1.00 37.88 
47.126 55.164 71.884 1.00 33.61 
15.488 43.883 65.882 1.00 23.44 
37.706 50.611 24.676 1.00 33.21 
37.379 23.810 9.516 1.00 34.84 
38.957 20.044 55.559 1.00 43.10 
24.959 16.071 14.898 1.00 34.26 
22.429 16.629 37.874 1.00 31.87 
29.356 53.921 43.835 1.00 38.69 
60.523 35.360 57.102 1.00 40.12 
29.303 20.134 16.704 1.00 26.85 
19.799 33.303 59.063 1.00 32.03 
19.367 18.997 32.033 1.00 30.19 
48.165 22.382 58.513 1.00 36.12 
43.741 63.487 53.233 1.00 33.28 
33.863 20.218 11.695 1.00 26.21 
49.235 59.976 52.718 1.00 28.98 
63.989 32.978 44.987 1.00 35.98 
38.155 21.333 49.259 1.00 27.87 
28.080 14.345 39.095 1.00 31.98 
15.034 38.648 61.496 1.00 35.94 
58.771 42.322 58.976 1.00 26.19 
47.485 42.154 69.878 1.00 35.48 
26.602 32.519 69.106 1.00 23.36 
28.022 17.338 22.711 1.00 31.24 
26.711 49.956 71.239 1.00 ^8.80 
64.021 22.891 42.685 1.00 33.14 
47.677 33.744 7.930 1.00 25.60 
34.908 23.405 49.212 1.00 24.56 
46.968 46.697 68.856 1.00 35.00 
41.692 11.412 30.635 1.00 31.37 
34.423 48.837 63.936 1.00 25.42 
38.612 61.793 51.417 1.00 33.87 
45.067 50.306 71.708 1.00 44.52 
46.249 23.672 42.556 1.00 31.95 
22.948 57.447 63.178 1.00 39.30 
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53.305 20.101 34.868 1.00 41.13 
39 320 63.985 53.074 1.00 28.86 
20.574 21.558 47.751 1.00 30.99 
70.490 44.479 44.539 1.00 22.23 
13.638 23.050 34.936 1.00. 28.58 
61.304 38.113 59.657 1.00 31.77 
70.418 49.240 43.244 1.00 47.36 
27.272 13.615 45.529 1.00 35.16 
40.595 52.830 28.676 1.00 35.25 
41.406 39.978 15.308 1.00 23.02 
47.358 18.488. 60.102 1.00 48.76 
30.061 44.962 27.407 1.00 28.02 
69.978 22.947 32.361 1.00 35.17 
54.106 56.970 32.477 1.00 27.90 
48.115 18.063 53.954 1.00 37.64 
38.320 16.416 14.841 1.00 28.53 
29.037 12.266 32.860 1.00 24.29 
29.314 35.536 60.124 1.00 39.44 
47.444 23.555 12.870 1.00 36.87 
40.389 49.471 57.522 1.00 28.51 
47.998 48.009 65.323 1.00 45.77 
38.192 25.689 48.335 1.00 36.54 
22.607 20.057 46.747 1.00 33.16 
37.372 23.697 52.106 1.00 21.54 
33.554 55.985 48.705 1.00 26.17 
48.510 43.090 21.681 1.00 30.35 
47.263 49.883 23.397 1.00 41.75 
19.519 18.617 44.162 1.00 42.73 
37.700 6.935 33.344 1.00 44.09 
44.887 20.648 40.251 1.00 45.35 
67,007 44.355 29.244 1.00 44.80 
65.458 35.373 34.954 1.00 44.52 
30.989 20.633 65.111 1.00 35.79 
26.918 36.574 22.781 1.00 35.02 
19.103 25.427 48.338 1.00 25.31 
47.801 58.546 30.861 1.00 32.01 
33.665 57.659 31.778 1.00 36.66 
60.170 48.247 52.476 1.00.32.47 
63.978 36.551 52.728 1.00 34.68 
29.804 56.958 72.469 1.00 49.60 
9.989 39.449 62.826 1.00 44.68 
36.235 28.967 7.281 1.00 44.34 
49.505 50.913 30.172 1.00 29.44 
11.062 37.216 34.519 1.00 29.29 
44.702 22.987 39.107 1.00 36.93 
50.857 38.850 67.888 1.00 36.37 
60.252 35.317 28.988 1.00 29.45 
20.726 30.784 57.864 1.00 29.69 
20.309 29.029 31.082 1.00 23.76 
21.059 52.048 61.212 1.00 28.93 
44.854 20.536 43.319 1.00 44.76 
19.996 28.678 33.868 1.00 30.03 
33 822 36.086 54.053 1.00 31.99 
53.120 30.359 35.501 1.00 32.80 
54 882 53.339 33.303 1.00 30.28 
55.781 39.618 64.945 1.00 34.34 
48.371 45.507 17.871 1.00 36.20 
66.334 38.117 49.319 1.00 23.90 
16 758 47.920 39.004 1.00 37.89 
54.280 18.755 56.686 1.00 52.80 
31.023 49.906 52.390 1-00 45.47 
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12.055 
31.945 
13.807 
55.521 
17.175 
36.313 
48.801 
40.208 
14.148 
38.755 
62.000 
20.078 
63.279 
35. 585 
29.682 
46.780 
44.882 
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25.981 
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54.501 
25. 680 
57.746 
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9.889 
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30.232 
56.840 
70.126 
46. 992 
49.137 
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30.591 
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55.459 
64 .734 
47.784 
53.971 
46. 637 
63.372 
26.222 
39.141 
59.271 
55. 454 
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42. j06 
56.720 
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61. 887 
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Sequences 
Seq ID No. 1 
WT-ACE 

SQQVTVTHGTSSQATTSSQTTTHQATAHQTSAQSPNLVTDEAEASKFVEEYD 
RTS Q VVWNEYAEANWNYNTNITTET^ 

WQLQNTTIKiaiKKVQDLERAALPAQELEEYNKILLDMETTYSVATVCOT 

SCLQLEPDLTNVMATSPJCYEDLLWAWEGWPvDKAGRAILQFYPKYVELINQA 

APXNGYVDAGDSWRSMYETPSLEQDLEPXFQELQPLYLNLHAYVRRALHRH 

YGAQHINLEGPIPAHLLGNMWAQWSNrYDLWPFPSAPSMDTTEAMLKQG 

WTPRRMFKEADDFFTSLGLLPWPEFWNKSMLEKPTDGREVVCHASAWDFY 

NGKDFRIKQCTTVNLEDLWAHHEMGfflQYFMQYKJDLPVALREGANPGFHE 

AIGDVLALSVSTPKHLHSLNLLSSEGGSDEHDINFLMKMALDKIAFIPFSYLVD 

QWRWRWDGSITKENYNQEWWSLPvLKYQGLCPPVPRTQGDFDPGAKFHIPS 

SWYIRYFVSFnQFQFHEALCQAAGHTGPLHKCDIYQSKEAGQRLATAMKXG 

FSRPWPEAMQLITGQPNMSASAMLSYFKPLLDWLRTENELHGEKLGWPQYN 

WTPNSARSEGPLPDSGRVSFLGLDLDAQQARVGQWLLLFLGIALLVATLGLS 

QRLFSIRHRSLHRHSHGPQFGSEVELRHS 



Seq ID No. 2 
ACE-A36NJ 

LVTDEAEASKEVEEYDRTSQWWNEYAEANWNYNTNITTETSKILLQKNMQI 

ANHTLKYGTQARKFDWQLQNTnKRIIKKVQDLERAALPAQELEEYNKILLD 

METTYSVATVCHPNGSCLQLEPDLTmn^ATSRKYEDLLWAWEGWRDKAGR 

AILQFYPKYVELINQAARLNGYVDAGDSWRSMYETPSLEQDLERLFQELQPL 

YLNEHAYVRRALHRHY GAQHINLEGP IP AHLLGNMW AQTWSNI YDLVVPFP 

SAPSMDTTEAMLKQGWTPRRMFKEADDFFTSLGLLPVPPEFWNKSMLEKPT 

DGREWCHASAWDFYNGKDFRnCQCTTVNLEDLVVAHHEMGHIQYFMQYK 

DLPVALREGANPGFHEAIGDVLAESVSTPKHLHSLNLLSSEGGSDEHDINFLM 

BOS^DKIAFIPFSYLVDQWRWRVFDGSITKENYNQEWWSLRLKYQGLCPPV 

PRTQGDFDPGAKFHIPSSVPYIRYFVSFnQFQFHEALCQAAGHTGPLHKCDIY 

QSKEAGQRLATAMKLGFSRPWPEAMQLITGQPNMSASAMLSYFKPLLDWLR 

TE 

NELHGEKLGWP Q YNWTPNS 
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CLAIMS 

1 . A crystal of ACE protein. 

2. A crystal according to claim 1 wherein the ACE protein is underglycosjdated. 

3. A crystal according to claim 2 wherein the ACE protein is underglycosylated 
by removing one or more glycosylation sites and/or one or more partially glycosylated 
sites. 

4. A crystal according to claim 3 wb -rein the underglycosylated ACE protein 
comprises a mutation at amino acid 337 or amino acid 90, 109, 155, 337 and 586 of 
SEQ ED No 2. 

5. A crystal according to any one of the preceding claims comprising atoms 
arranged in a spatial relationship represented by at least a portion of the structure co- 
ordinates of Table A or Table B. 

6. A crystal according to any one of the preceding claims wherein the crystal 
belongs to the space group P2i2j2i . 

7. A crystal according to any one of the preceding claims having unit cell 
dimensions of: a= 56.47 A, b=84.90 A, c=l 33.99 A. 

8. A crystal according to any one of the preceding claims wherein the crystal is a 
crystal of human ACE protein. 

9. A crystal according to any one of the preceding claims wherein the crystal 
further comprises an entity bound to the ACE protein or a portion thereof. 



10. A crystal according to claim 9 wherein the entity is bound to the ACE protein 
or a portion thereof by contacting one or more residues of the ACE protein selected 
from: His384, Ala385, Lys542, Tyr551, Tyr554, Glu415 and His544. 
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11. A crystal according to claim 9 or claim 10 wherein the entity modulates the 
activity of ACE. 

12. A crystal according to claim 1 1 wherein the entity is an inhibitor of ACE. 

13. A crystal according to claim 12 wherein the inhibitor of ACE is lisinopril or a 
derivative thereof. 

14. A crystal according to claim 13 comprising atoms arranged in a spatial 
relationship represented by at least a portion of the structure co-ordinates of Table B. 

15. A method of preparing a crystal of ACE protein comprising the steps of: 

(a) culturing host cells comprising an underglycosylated ACE protein; 

(b) purifying the underglycosylated ACE protein; and 

(c) crystallising the underglycosylated ACE protein. 

16. A method according to claim 15 wherein the ACE protein is 
underglycosylated by removing one or more glycosylation sites and/or one or more 
partially glycosylated sites. 

17. A method according to claim 15 or claim 16 wherein the underglycosylated 
ACE protein comprises a mutation at amino acid 337 of SEQ ED No 2 or amino acids 
90, 109, 155,337and.586 of SEQ ID No 2. 

18. A method according to any of claims 15 to 17 wherein the ACE protein is 
crystallised using about 10 mM HEPES and about 0.1% PMSF with an equal volume 
of a reservoir solution containing about 15 % PEG 4000, about 50 mM 
CH 3 COONa.3H 2 0 pH 4.7 and about 10 uM ZnS0 4 .7H 2 0. 

19. A method according to any of claims 15 to 18 wherein me crystal that is 
prepared has a structure defined by at least a portion of the structure co-ordinates of 
Table A. 
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20. A method according to any of claims 15 to 19 wherein the crystal belongs to 
the space group P2i2j2j. 

21. A method according to any of claims 15 to 20 wherein the crystal has the unit 
cell dimensions: a= 56.47 A, b=84.90 A and c=133.99 A. 

22. A method according to any of claims 15 to 21 wherein the ACE protein is 
human ACE protein. 

23. A method according to any of claims 15 to 22 wherein the ACE protein is 
crystallised in the presence of an entity. 

24. A method ac^c -ding to claim 23 wherein the entity is a modulator of ACE. 

25. A method according to claim 24 wherein the entity is an inhibitor of ACE. 

26. A method according to claim 25 wherei n the inhibitor of ACE is lisinopril or a 
derivative thereof. 

?7. A method according to claim 26 wherein the crystal that is prepared h?s a 
structure defined by at least a portion of the structure co-ordinates of Table. B. 

28. A method of serening for a modulator of ACE wherein the method comprises 
the use of a crystal according to any of claims 1-14. 

29. A method according to claim 28 comprising the steps of: 

(a) providing at least a portion of the structure co-ordinates of Table A or Table B; 



(b) employing at least a portion of the structure co-ordinates of Table A or Table B to 
design or select or synthesise a putative modulator of ACE; 
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(c) contacting the putative modulator of ACE with ACE or a mutant, variant, 
homologue, derivative or fragment thereof in the presence of a substrate; and 

(d) screening the putative modulator of ACE in an assay for the potential to modulate 
ACE. 

30. A method according to claim 29 wherein at least a portion of the structure co- 
ordinates of Table A or Table B and/or the putative modulator of ACE and/or the 
substrate are provided on a machine-readable data storage medium comprising a data 
storage material encoded with machine readable data. 

31. A method according to claim 29 or claim 30 wherein the putative ACE 
modulator is from a library of compounds. 

32. A method according claim 29 or claim 30 wherein the putative ACE modulator 
is selected from a database. 

33. A method according to claim 29 or claim 30 wherein the putative ACE 
modulator is designed de novo. 

34. A method according to claim 29 or claim 30 wherein the putative ACE 
modulator is designed from a known ACE modulator. 

35. A method according to claim 29 or claim 30 wherein the design or selection of 
, the putative ACE modulator is performed in conjunction with computer modelling. 

36. A method according to any of claims 28 to 35 wherein the ACE modulator is 
useful in the prevention and/or treatment of an ACE related disorder. 

37. A method according to claim 36 wherein the ACE related disorder is 
hypertension. 



38. A process comprising the steps of: 




(a) performing the method according to any of claims 28 to 3 6; 

(b) identifying one or more modulators of ACE; and 

(c) preparing a quantity of those one or more ACE modulators. 

39. A process comprising the steps of: 

(a) performing the method according to any of claims 28 to 36; 

(b) identifying one or more ACE modulators; and 

(c) preparing a pharmaceutical composition comprising those one or more identified 
ACE modulators. 

40. A process comprising the steps of: 

(?) performing the method according to any of claims 28 to 36; 

(b) identifying one or more ACE modulators; 

(c) modifying those one or more ACE modulators; and 

(d) optionally preparing a pharmaceutical composition comprising those one or 
more ACE modulators. 

41. A method of obtaining structural information about a molecule or a molecular 
complex of unknown structure by using ut least a portion of the structure co-ordinates of 
ACE, comprising the steps of: 

(a) generating X-ray diffraction data from a crystallised molecule or molecular 
complex; 
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(b) applying at least a portion of the structure co-ordinates of ACE to said X-ray 
diffraction pattern to generate a three dimensional electron density map of at least 
a portion of the molecule or molecular complex; and 

5 (c) using all or a portion of the structure co-ordinates of ACE to generate homology • 
models of ACE. 

42. An ACE modulator identified by the method of any one of claims 28 to 36. 
10 43. An ACE modulator according to claim 42 wherein the ACE modulator inhibits 



44. A pharmaceutical composition comprising an ACE modulator according to 
claim 42 or claim 43 and a pharmaceutical^ acceptable carrier, diluent, excipient or 

15 adjuvant or any combination thereof. 

45. A method of preventing and/or treating an an ACE related disorder comprising 
administering a modulator of ACE according to claim 42 or claim 43 and/or a 
pharmaceutical according to claim 44 wherein said modulator of ACE or said 

20 pharmaceutical is capable of causing a beneficial preventative and/or therapeutic 



46. A computer for producing a three-dimensional representation of ACE wherein 
said computer comprises: 



(a) a computer-readable data storage medium comprising a data storage material 
encoded with computer-readable data, wherein said data comprises the structure co- 
ordinates of ACE; 

30 (b) a working memory for storing instructions for processing said computer-readable 
data; 



ACE. 



effect. 



25 



(c) a central-processing unit coupled to said working memory and to said computer- 
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readable data storage medium for processing said computex-machine readable data 
into said three-dimensional representation; and 

(d) a display coupled to said central-processing unit for displaying said three- 
dimensional representation. 

47. A machine-readable data storage medium comprising a ds f a storage material 
encoded with machine readable data, wherein the data is defined by at least a portion 
of the structure co-ordinates of ACE in Table A or Table B. 

48. Use of an ACE crystal in the preparation of a medicament to prevent and/or 
treat an ACE related disorder. 

49. Use according to claim 48 wherein the ACE related disorder is hypertension. 

50. Use of at least a portion of the structure co-ordinates of Table A or Table B to 
screen for modulators of ACE. 

51 . Use of at least a portion of the structure co-ordinates of Table A or Table B to 
rxlve the structure of the crystalline form of any other protcui with significant amino 
rcid :; "qucnce homology to ar y functional domain of ACE. 

52. Use of at least a portion of the structure co-ordinates of Table A or Table B in 
molecular design techniques to design, select and synthesise modulators of ACE. 

53 . Use of at least a portion of the structure co-ordinates of Table A or Table B in 
the development of compounds that can isomerise to reaction intermediates in the 
chemical reaction of a substrate or other compound that binds to ACE. 



54. Use of at least a portion of the structure co-ordinates of Table A or Table B to 
screen small molecule databases for chemical entities or compounds that modulate 
ACE. 
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55. ' Use of at least a portion of the structure co-ordinates of Table A or Table B to 



solve the structure of the crystalline form of any other protein with significant amino 
acid sequence homology to any functional domain of ACE. 

56. Use according to claim 55 wherein the structure of the crystalline form of any 
other protein with significant amino acid sequence homology to any functional 
domain of ACE is solved using molecular replacement. 



57. pLEN- tACEA36g(l, 2, 3, 4). 



58. P LEN-tACEA36g(l,3). 




ABSTRACT 



CRYSTAL 



The present invention relates to a crystal of ACE protein. The present invention 
further relates to methods, processes, ACE modulators, pharmaceutical compositions 
and uses of the ACE crystal and the structure co-ordinates thereof. 
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